-
TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Paper • 2505.14625 • Published • 13 -
1
TinyV
💬Verify model answers against ground truth
-
zhangchenxu/TinyV-Qwen3-1.7B
Text Generation • 2B • Updated • 8 -
zhangchenxu/TinyV-Qwen3-1.7B-Think
Text Generation • 2B • Updated • 7 • 1
Zhangchen Xu PRO
zhangchenxu
AI & ML interests
LLM Data, Alignment, Post-Training, Safety
Recent Activity
updated
a model
4 days ago
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step312
published
a model
4 days ago
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step312
updated
a model
4 days ago
zhangchenxu/RB-Qwen2.5-VL-3B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step288