VPPO Data Collection Official training and evaluation datasets for the VPPO project. • 4 items • Updated 8 days ago • 2
VPPO Model Collection SOTA models for multimodal reasoning, fine-tuned with VPPO. Achieves superior performance by focusing on critical visual tokens. • 3 items • Updated 8 days ago • 3
Spotlight on Token Perception for Multimodal Reinforcement Learning Paper • 2510.09285 • Published 11 days ago • 35
Native Hybrid Attention for Efficient Sequence Modeling Paper • 2510.07019 • Published 13 days ago • 16
Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration Paper • 2509.14760 • Published Sep 18 • 52
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper • 2508.09834 • Published Aug 13 • 53