Xiyao Wang's picture

5 36 7

Xiyao Wang

russwang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents

upvoted a paper 23 days ago

Token-Level LLM Collaboration via FusionRoute

authored a paper 2 months ago

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following

View all activity

Organizations

upvoted a paper 5 days ago

Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents

Paper • 2601.18217 • Published 6 days ago • 9

upvoted a paper 23 days ago

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published 24 days ago • 39

upvoted a paper 2 months ago

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following

Paper • 2511.21662 • Published Nov 26, 2025 • 11

upvoted 6 papers 3 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 82

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4, 2025 • 59

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published Nov 3, 2025 • 32

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published Oct 28, 2025 • 18

PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection

Paper • 2510.23594 • Published Oct 27, 2025 • 6

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Paper • 2509.25541 • Published Sep 29, 2025 • 140

upvoted 4 papers 4 months ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1, 2025 • 59

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 273

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published Sep 28, 2025 • 48

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published Sep 29, 2025 • 19

upvoted a collection 4 months ago

LLaVA-OneVision-1.5

https://github.com/EvolvingLMMs-Lab/LLaVA-OneVision-1.5 • 9 items • Updated Oct 21, 2025 • 19

upvoted 2 papers 5 months ago

CaughtCheating: Is Your MLLM a Good Cheating Detective? Exploring the Boundary of Visual Perception and Reasoning

Paper • 2507.00045 • Published Jun 23, 2025 • 1

OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning

Paper • 2509.01644 • Published Sep 1, 2025 • 34

upvoted a collection 5 months ago

LLaVA-Critic-R1

6 items • Updated Sep 3, 2025 • 2

upvoted a paper 5 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 85

upvoted 2 papers 7 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30, 2025 • 50

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Paper • 2506.18095 • Published Jun 22, 2025 • 66