chamber111
/

VPPO-32B

visual-reasoning

Model card Files Files and versions

chamber111 commited on 9 days ago

Commit

ac17eae

·

verified ·

1 Parent(s): 136290d

Update README.md

Files changed (1) hide show

README.md +11 -8

README.md CHANGED Viewed

@@ -31,7 +31,7 @@ As a result, VPPO-32B demonstrates significant performance improvements over str
 ### Model Sources
 - **Repository:** [`VPPO-RL`](https://github.com/huaixuheqing/VPPO-RL)
-- **Paper:** `[Please Fill In: Link to the arXiv paper]`
 -
 ## Training Details
@@ -77,11 +77,14 @@ If you use this model in your work, please cite our paper:
 **BibTeX:**
-<!-- ```bibtex
-@article{yourname2025vppo,
-  title={Spotlight on Token Perception for Multimodal Reinforcement Learning},
-  author={[Please Fill In: Authors of the paper]},
-  journal={arXiv preprint arXiv:2510.XXXXX},
-  year={2025}
 }
-``` -->

 ### Model Sources
 - **Repository:** [`VPPO-RL`](https://github.com/huaixuheqing/VPPO-RL)
+- **Paper:** [`2510.09285`](https://arxiv.org/abs/2510.09285)
 -
 ## Training Details
 **BibTeX:**
+```bibtex
+@misc{huang2025spotlighttokenperceptionmultimodal,
+      title={Spotlight on Token Perception for Multimodal Reinforcement Learning},
+      author={Siyuan Huang and Xiaoye Qu and Yafu Li and Yun Luo and Zefeng He and Daizong Liu and Yu Cheng},
+      year={2025},
+      eprint={2510.09285},
+      archivePrefix={arXiv},
+      primaryClass={cs.CV},
+      url={https://arxiv.org/abs/2510.09285},
 }
+```