Rethinking State Tracking in Recurrent Models Through Error Control Dynamics Paper • 2605.07755 • Published 10 days ago • 23
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published Jan 4 • 46
RefineBench: Evaluating Refinement Capability of Language Models via Checklists Paper • 2511.22173 • Published Nov 27, 2025 • 15
Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation Paper • 2510.19592 • Published Oct 22, 2025 • 13
mirlabcollaboration/v1-internvl2.5-fix-image-size-448-7B-checkpoint-22000 8B • Updated Aug 12, 2025 • 1
mirlabcollaboration/v1-internvl2.5-fix-image-size-448-7B-checkpoint-22000 8B • Updated Aug 12, 2025 • 1
view article Article Efficient MultiModal Data Pipeline +3 ariG23498, lusxvr, andito, sergiopaniego, pcuenq • Jul 8, 2025 • 70
Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs Paper • 2507.07990 • Published Jul 10, 2025 • 45
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers Paper • 2506.23918 • Published Jun 30, 2025 • 90
Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics Paper • 2506.00070 • Published May 29, 2025 • 29