zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step320 8B • Updated about 2 hours ago
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step160 8B • Updated about 2 hours ago
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step128 8B • Updated about 2 hours ago
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step64 8B • Updated about 2 hours ago
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step272 8B • Updated about 5 hours ago
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step256 8B • Updated about 5 hours ago
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step192 8B • Updated about 6 hours ago
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step80 8B • Updated about 23 hours ago • 1
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step160 8B • Updated 1 day ago • 1
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step240 8B • Updated 1 day ago • 1
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step304 8B • Updated 1 day ago • 1
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step256 8B • Updated 1 day ago • 1
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step144 8B • Updated 1 day ago • 74
zhangchenxu/Qwen2.5-VL-7B-Instruct-SFT-visualsphinx_10k_random-LR2.0e-5-EPOCHS3-LF Image-to-Text • 8B • Updated 3 days ago • 23
zhangchenxu/Qwen2.5-VL-7B-Instruct-SFT-visualsphinx_10k_reject-LR2.0e-5-EPOCHS3-LF Image-to-Text • 8B • Updated 3 days ago • 24
zhangchenxu/Qwen2.5-VL-7B-Instruct-SFT-visualsphinx_10k_random-LR2.0e-5-EPOCHS2-LF-10000000 Updated 3 days ago
zhangchenxu/Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp12_nothink-GRPO-01_step256 8B • Updated May 14 • 15