FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios Paper • 2604.07413 • Published 6 days ago • 85
ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget Paper • 2604.01195 • Published 13 days ago • 3
Watch Before You Answer: Learning from Visually Grounded Post-Training Paper • 2604.05117 • Published 8 days ago • 35
SWE-Next: Scalable Real-World Software Engineering Tasks for Agents Paper • 2603.20691 • Published 24 days ago • 10
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published Feb 5 • 36
FinMMEval Lab @CLEF'2026 Collection Training datasets for FinMMEval Lab @CLEF'2026 • 12 items • Updated 23 days ago • 9
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper • 2512.02014 • Published Dec 1, 2025 • 74
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published Dec 2, 2025 • 55
MrLight/general-reasoner-megamath-fineweb-1014-filter-top40-new Viewer • Updated Nov 4, 2025 • 1.1M • 9