1 17 11

Shuo Yang

ShuoY

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

Look-Back: Implicit Visual Re-focusing in MLLM Reasoning

authored a paper 4 days ago

CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-step

authored a paper 4 days ago

Reinforcement Learning with Inverse Rewards for World Model Post-training

View all activity

Organizations

None yet

authored 5 papers 4 days ago

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Paper • 2603.22446 • Published 13 days ago • 8

commented a paper 4 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 16 days ago • 314 •

upvoted 2 papers 4 days ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 26 days ago • 148

Lingshu-Cell: A generative cellular world model for transcriptome modeling toward virtual cells

Paper • 2603.25240 • Published 10 days ago • 75

upvoted a paper 5 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 16 days ago • 314

upvoted 2 papers about 1 month ago

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published Mar 4 • 184

Enhancing Spatial Understanding in Image Generation via Reward Modeling

Paper • 2602.24233 • Published Feb 27 • 58

upvoted a collection 6 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 684

updated a dataset 9 months ago

ShuoY/Look-Back-eval

Preview • Updated Jul 5, 2025 • 9

updated a model 9 months ago

ShuoY/Solution-back-7B

8B • Updated Jul 4, 2025 • 5 • 1

published a dataset 9 months ago

ShuoY/Look-Back-eval

Preview • Updated Jul 5, 2025 • 9

updated a model 9 months ago

ShuoY/Semantic-back-7B

8B • Updated Jul 4, 2025 • 10 • 1

published 2 models 9 months ago

ShuoY/Solution-back-7B

8B • Updated Jul 4, 2025 • 5 • 1

ShuoY/Semantic-back-7B

8B • Updated Jul 4, 2025 • 10 • 1

upvoted 2 papers 10 months ago

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5, 2025 • 135

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2, 2025 • 48

Shuo Yang

AI & ML interests

Recent Activity

Organizations

ShuoY's activity