VPPO Model - a chamber111 Collection

chamber111 's Collections

VPPO Model

updated 11 days ago

SOTA models for multimodal reasoning, fine-tuned with VPPO. Achieves superior performance by focusing on critical visual tokens.

chamber111/VPPO-7B

Image-Text-to-Text • 8B • Updated 8 days ago • 42 • 4
chamber111/VPPO-32B

33B • Updated 8 days ago • 34 • 2
Spotlight on Token Perception for Multimodal Reinforcement Learning

Paper • 2510.09285 • Published 14 days ago • 35