4 5 1

xzxuan

AI & ML interests

None yet

Recent Activity

new activity 16 days ago

RLinf/WideSeek-R1-test-data:Update README.md

new activity 16 days ago

RLinf/WideSeek-R1-test-data:Update README.md

updated a dataset 16 days ago

RLinf/Wiki-2018-Corpus

View all activity

Organizations

New activity in RLinf/WideSeek-R1-test-data 16 days ago

Update README.md

#3 opened 16 days ago by

xzxuan

Update README.md

#2 opened 16 days ago by

xzxuan

updated 2 datasets 16 days ago

RLinf/Wiki-2018-Corpus

Updated 16 days ago • 2.68k

RLinf/WideSeek-R1-train-data

Preview • Updated 16 days ago • 110 • 2

updated a model 16 days ago

RLinf/WideSeek-R1-4b

Text Generation • 4B • Updated 16 days ago • 60 • 2

upvoted 2 papers about 2 months ago

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

Paper • 2509.15965 • Published Sep 19, 2025 • 19

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published Feb 4 • 98

updated a collection about 2 months ago

WideSeek-R1

Collection

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning • 5 items • Updated 16 days ago

published a dataset about 2 months ago

RLinf/Wiki-2018-Corpus

Updated 16 days ago • 2.68k

published a model about 2 months ago

RLinf/WideSeek-R1-4b

Text Generation • 4B • Updated 16 days ago • 60 • 2

published a dataset about 2 months ago

RLinf/WideSeek-R1-train-data

Preview • Updated 16 days ago • 110 • 2

upvoted a paper 4 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 109

New activity in RLinf/RLinf-OpenVLA-GRPO-ManiSkill3-25ood 6 months ago

Update README.md

#2 opened 6 months ago by

HillFir

New activity in RLinf/RLinf-OpenVLAOFT-GRPO-ManiSkill3-25ood 6 months ago

Update README.md

#2 opened 6 months ago by

HillFir

published a dataset 6 months ago

xzxuan/VS-Bench

Updated Sep 23, 2025 • 4

published 2 models 7 months ago

RLinf/RLinf-OpenVLAOFT-GRPO-ManiSkill3-25ood

Reinforcement Learning • 8B • Updated Oct 10, 2025 • 8

RLinf/RLinf-OpenVLA-GRPO-ManiSkill3-25ood

Reinforcement Learning • 8B • Updated Oct 10, 2025 • 6

updated a model 7 months ago

RLinf/RLinf-OpenVLAOFT-GRPO-ManiSkill3-25ood

Reinforcement Learning • 8B • Updated Oct 10, 2025 • 8

xzxuan

AI & ML interests

Recent Activity

Organizations

xzxuan's activity

Update README.md

Update README.md

Update README.md

Update README.md