CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation Paper β’ 2601.10061 β’ Published 5 days ago β’ 28