Submitted by Perry the Platypus 23 Watch Before You Answer: Learning from Visually Grounded Post-Training Natural and Artificial Intelligence Lab 0 1