-
TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Paper • 2505.14625 • Published • 13 -
1
TinyV
💬Verify model answers against ground truth
-
zhangchenxu/TinyV-Qwen3-1.7B
Text Generation • 2B • Updated • 7 -
zhangchenxu/TinyV-Qwen3-1.7B-Think
Text Generation • 2B • Updated • 8 • 1
Zhangchen Xu PRO
zhangchenxu
AI & ML interests
LLM Data, Alignment, Post-Training, Safety
Recent Activity
published
a model
about 11 hours ago
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step80
updated
a model
about 11 hours ago
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step80
updated
a model
about 13 hours ago
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step160