RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO Paper • 2605.15190 • Published 4 days ago • 9
ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both Paper • 2605.15198 • Published 4 days ago • 17
Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation Paper • 2605.04128 • Published 13 days ago • 17
A Benchmark for Interactive World Models with a Unified Action Generation Framework Paper • 2605.03941 • Published 13 days ago • 5
Ex0bit/Gemma4-26B-A4B-PRISM-PRO-DQ-GGUF Image-Text-to-Text • 25B • Updated Apr 11 • 9.48k • 71
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU Paper • 2604.05091 • Published Apr 6 • 46