Update README.md
Browse files
README.md
CHANGED
@@ -31,7 +31,7 @@ As a result, VPPO-32B demonstrates significant performance improvements over str
|
|
31 |
### Model Sources
|
32 |
|
33 |
- **Repository:** [`VPPO-RL`](https://github.com/huaixuheqing/VPPO-RL)
|
34 |
-
- **Paper:** `
|
35 |
-
|
36 |
## Training Details
|
37 |
|
@@ -77,11 +77,14 @@ If you use this model in your work, please cite our paper:
|
|
77 |
|
78 |
**BibTeX:**
|
79 |
|
80 |
-
|
81 |
-
@
|
82 |
-
|
83 |
-
|
84 |
-
|
85 |
-
|
|
|
|
|
|
|
86 |
}
|
87 |
-
```
|
|
|
31 |
### Model Sources
|
32 |
|
33 |
- **Repository:** [`VPPO-RL`](https://github.com/huaixuheqing/VPPO-RL)
|
34 |
+
- **Paper:** [`2510.09285`](https://arxiv.org/abs/2510.09285)
|
35 |
-
|
36 |
## Training Details
|
37 |
|
|
|
77 |
|
78 |
**BibTeX:**
|
79 |
|
80 |
+
```bibtex
|
81 |
+
@misc{huang2025spotlighttokenperceptionmultimodal,
|
82 |
+
title={Spotlight on Token Perception for Multimodal Reinforcement Learning},
|
83 |
+
author={Siyuan Huang and Xiaoye Qu and Yafu Li and Yun Luo and Zefeng He and Daizong Liu and Yu Cheng},
|
84 |
+
year={2025},
|
85 |
+
eprint={2510.09285},
|
86 |
+
archivePrefix={arXiv},
|
87 |
+
primaryClass={cs.CV},
|
88 |
+
url={https://arxiv.org/abs/2510.09285},
|
89 |
}
|
90 |
+
```
|