World Action Models: The Next Frontier in Embodied AI Paper • 2605.12090 • Published 6 days ago • 62
The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping Paper • 2604.11297 • Published Apr 13 • 143
OpenMOSS-Team/MOSS-VL-Instruct-0408 Video-Text-to-Text • 11B • Updated 25 days ago • 3.74k • 93
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning Paper • 2603.04918 • Published Mar 5 • 56
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models Paper • 2602.10934 • Published Feb 11 • 49