Submitted by taesiri 117 InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency · 61 authors 4
Submitted by taesiri 34 Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation · 9 authors 2
Submitted by Kaiyue 20 T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation · 5 authors 20 2
Submitted by Ironieser 19 MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs · 6 authors 3
Submitted by BAOLONGZHANSHEN 17 Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning · 13 authors 2
Submitted by mbur 15 Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling · 12 authors 45 9
Submitted by Wyattz23 9 PosterGen: Aesthetic-Aware Paper-to-Poster Generation via Multi-Agent LLMs · 5 authors 36 3
Submitted by omidgh 6 MEENA (PersianMMMU): Multimodal-Multilingual Educational Exams for N-level Assessment · 11 authors 2
Submitted by taesiri 4 ST-Raptor: LLM-Powered Semi-Structured Table Question Answering · 9 authors 11 2
Submitted by ControlNet 2 Explain Before You Answer: A Survey on Compositional Visual Reasoning · 13 authors 4 2
Submitted by Hecheng0625 2 TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling · 6 authors 97 2
Submitted by taesiri 1 Neither Valid nor Reliable? Investigating the Use of LLMs as Judges · 4 authors 2
Submitted by stefan-it 1 German4All - A Dataset and Model for Readability-Controlled Paraphrasing in German · 6 authors 1 5
Submitted by RuijieZhu 1 MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting · 8 authors 8 2
Submitted by tristan-deep - Semantic Diffusion Posterior Sampling for Cardiac Ultrasound Dehazing · 3 authors 1 2
Submitted by stefanos50 - REGEN: Real-Time Photorealism Enhancement in Games via a Dual-Stage Generative Network Framework · 2 authors 0 2
Submitted by dipta007 - If We May De-Presuppose: Robustly Verifying Claims through Presupposition-Free Question Decomposition · 2 authors 0 2