Lost in Stories: Consistency Bugs in Long Story Generation by LLMs Paper • 2603.05890 • Published 8 days ago • 81
HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising Paper • 2603.08703 • Published 4 days ago • 29
CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing Paper • 2603.08589 • Published 4 days ago • 34
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published 10 days ago • 52
Believe Your Model: Distribution-Guided Confidence Calibration Paper • 2603.03872 • Published 10 days ago • 36
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence Paper • 2603.07660 • Published 5 days ago • 77
Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets Paper • 2602.22207 • Published 16 days ago • 42
Enhancing Spatial Understanding in Image Generation via Reward Modeling Paper • 2602.24233 • Published 14 days ago • 52
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published 14 days ago • 86
The Trinity of Consistency as a Defining Principle for General World Models Paper • 2602.23152 • Published 15 days ago • 196
MOSPA: Human Motion Generation Driven by Spatial Audio Paper • 2507.11949 • Published Jul 16, 2025 • 25
Seq vs Seq: An Open Suite of Paired Encoders and Decoders Paper • 2507.11412 • Published Jul 15, 2025 • 31
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering Paper • 2507.11527 • Published Jul 15, 2025 • 35
MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding Paper • 2507.12463 • Published Jul 16, 2025 • 27
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs Paper • 2507.09477 • Published Jul 13, 2025 • 88
SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories? Paper • 2507.12415 • Published Jul 16, 2025 • 43