2 23 1

Jiarui Yao

FlippyDora

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

jrtmp/dapo-math-17k

published a dataset 1 day ago

jrtmp/dapo-math-17k

updated a dataset 1 day ago

jrtmp/math500

View all activity

Organizations

updated a dataset 1 day ago

jrtmp/dapo-math-17k

Viewer • Updated 1 day ago • 1.79M

published a dataset 1 day ago

jrtmp/dapo-math-17k

Viewer • Updated 1 day ago • 1.79M

updated a dataset 1 day ago

jrtmp/math500

Viewer • Updated 1 day ago • 500

published a dataset 1 day ago

jrtmp/math500

Viewer • Updated 1 day ago • 500

upvoted a paper 7 days ago

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published 9 days ago • 63

upvoted 2 papers 14 days ago

HippoCamp: Benchmarking Contextual Agents on Personal Computers

Paper • 2604.01221 • Published 15 days ago • 29

PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning

Paper • 2603.26653 • Published 20 days ago • 18

updated a model 29 days ago

rb-dev/v-rubrics_opd-grpo_qwen3-vl-8b-instruct_g5-step260

9B • Updated 29 days ago • 157

published a model 29 days ago

rb-dev/v-rubrics_opd-grpo_qwen3-vl-8b-instruct_g5-step260

9B • Updated 29 days ago • 157

updated a model 29 days ago

rb-dev/v-rubrics_opd-grpo_qwen3-vl-8b-instruct_g5-step240

9B • Updated 29 days ago • 11

published a model 29 days ago

rb-dev/v-rubrics_opd-grpo_qwen3-vl-8b-instruct_g5-step240

9B • Updated 29 days ago • 11

updated a model 29 days ago

rb-dev/v-rubrics_opd-grpo_qwen3-vl-8b-instruct_g5-step160

9B • Updated 29 days ago • 11

published a model 29 days ago

rb-dev/v-rubrics_opd-grpo_qwen3-vl-8b-instruct_g5-step160

9B • Updated 29 days ago • 11

updated a model 29 days ago

rb-dev/v-rubrics_opd-grpo_qwen3-vl-8b-instruct_g5-step80

9B • Updated 29 days ago • 10

published a model 29 days ago

rb-dev/v-rubrics_opd-grpo_qwen3-vl-8b-instruct_g5-step80

9B • Updated 29 days ago • 10

updated a dataset 29 days ago

rb-dev/rubrics_train_data

Viewer • Updated 29 days ago • 101k • 14

upvoted a paper about 1 month ago

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

Paper • 2603.13985 • Published Mar 14 • 10

submitted a paper to Daily Papers about 1 month ago

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

Paper • 2603.13985 • Published Mar 14 • 10

updated a model about 1 month ago

rb-dev/Qwen3-VL-8B-Instruct-sft-epoch-3

9B • Updated Mar 9

published a model about 1 month ago

rb-dev/Qwen3-VL-8B-Instruct-sft-epoch-3

9B • Updated Mar 9

Jiarui Yao

AI & ML interests

Recent Activity

Organizations

FlippyDora's activity