ImageDoctor: Diagnosing Text-to-Image Generation via Grounded Image Reasoning Paper • 2510.01010 • Published Oct 1 • 1
XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models Paper • 2510.15148 • Published Oct 16 • 2