Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models Paper • 2601.07372 • Published Jan 12 • 47
REVERE: Reflective Evolving Research Engineer for Scientific Workflows Paper • 2603.20667 • Published 4 days ago • 11
PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost Paper • 2603.21383 • Published 3 days ago • 13
Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models Paper • 2603.18118 • Published 7 days ago • 10
Scaling DoRA: High-Rank Adaptation via Factored Norms and Fused Kernels Paper • 2603.22276 • Published 1 day ago • 9
SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models Paper • 2603.19028 • Published 6 days ago • 16
BubbleRAG: Evidence-Driven Retrieval-Augmented Generation for Black-Box Knowledge Graphs Paper • 2603.20309 • Published 6 days ago • 18
Group3D: MLLM-Driven Semantic Grouping for Open-Vocabulary 3D Object Detection Paper • 2603.21944 • Published 2 days ago • 25
RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models Paper • 2603.21341 • Published 3 days ago • 22
mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT Paper • 2603.21606 • Published 2 days ago • 31
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis Paper • 2603.20278 • Published 8 days ago • 62
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published 2 days ago • 40
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published 11 days ago • 67
VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding Paper • 2603.22285 • Published 1 day ago • 45
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published 3 days ago • 63
Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality Paper • 2603.17512 • Published 7 days ago • 3
Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders Paper • 2603.19209 • Published 6 days ago • 4