Xiaoyang Cao's picture

4

Xiaoyang Cao

Sean13

·

https://xiaoyangcao1113.github.io/

AI & ML interests

RLFH, Deep Reinfrocement Learning

Recent Activity

upvoted a paper 15 days ago

RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training

upvoted a paper 15 days ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

upvoted a paper 16 days ago

Latent Collective Preference Optimization: A General Framework for Robust LLM Alignment

View all activity

Organizations

None yet

upvoted 2 papers 15 days ago

RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training

Paper • 2510.06710 • Published 16 days ago • 36

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published 20 days ago • 91

upvoted a paper 16 days ago

Latent Collective Preference Optimization: A General Framework for Robust LLM Alignment

Paper • 2509.24159 • Published 25 days ago • 1

upvoted a paper 5 months ago

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Paper • 2506.02387 • Published Jun 3 • 58