Looking to Learn: Token-wise Dynamic Gating for Low-Resource Vision-Language Modelling
Paper
•
2510.08470
•
Published
•
1
Cambridge NLIP Multimodal BabyLM 2025: decoder with token-wise dynamic gating, feature modulation, and channel attention