Joint Level Generation And Translation Using Gameplay Videos