Hybrid Architectures for Language Models: Systematic Analysis and Design Insights Paper • 2510.04800 • Published 23 days ago • 36
Temporal Alignment Guidance: On-Manifold Sampling in Diffusion Models Paper • 2510.11057 • Published 16 days ago • 30
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper • 2507.10524 • Published Jul 14 • 70