Qwen2.5-VL-3B & 7B models trained with PC-GRPO in the paper: Puzzle Curriculum GRPO for Vision-Centric Reasoning
-
armenjeddi/PCGRPO-Qwen2.5-VL-3B-Jigsaw-Base-plus-curriculum-plus-CARE
4B • Updated • 21 -
armenjeddi/PCGRPO-Qwen2.5-VL-3B-MixPuzzles-Base-plus-curriculum-plus-CARE
4B • Updated • 5 -
armenjeddi/PCGRPO-Qwen2.5-VL-7B-Jigsaw-Base
8B • Updated • 7 -
armenjeddi/PCGRPO-Qwen2.5-VL-7B-Jigsaw-Base-plus-CARE
8B • Updated • 21