·
AI & ML interests
machine learning
Organizations
None yet
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi1_pi2
2B
•
Updated
•
1
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi1209
2B
•
Updated
•
1
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi2
2B
•
Updated
•
2
ypwang61/One-Shot-RLVR-Llama3.2-3B-Instruct-pi1_pi13
4B
•
Updated
•
1
ypwang61/One-Shot-RLVR-Llama3.2-3B-Instruct-1.2k-dsr-sub
4B
•
Updated
•
1
ypwang61/One-Shot-RLVR-Llama3.2-3B-Instruct-pi1
ypwang61/One-Shot-RLVR-R1-Distill-1.5B-1.2k-dsr-sub
Text Generation
•
2B
•
Updated
•
1
ypwang61/One-Shot-RLVR-R1-Distill-1.5B-16-shot
Text Generation
•
2B
•
Updated
•
5
ypwang61/One-Shot-RLVR-R1-Distill-1.5B-4-shot
Text Generation
•
2B
•
Updated
•
1
ypwang61/One-Shot-RLVR-R1-Distill-1.5B-pi1
Text Generation
•
2B
•
Updated
•
4
ypwang61/One-Shot-RLVR-Qwen2.5-7B-1.2k-dsr-sub
Text Generation
•
8B
•
Updated
•
1
ypwang61/One-Shot-RLVR-Qwen2.5-7B-pi1
Text Generation
•
8B
•
Updated
•
2
ypwang61/One-Shot-RLVR-Qwen2.5-Math-7B-1.2k-dsr-sub
Text Generation
•
8B
•
Updated
•
6
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-7.5k-MATH
Text Generation
•
2B
•
Updated
•
31
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-1.2k-dsr-sub
Text Generation
•
2B
•
Updated
•
4
ypwang61/intermediate-qwen25-7b-step300
8B
•
Updated
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi1_pi13
Text Generation
•
2B
•
Updated
•
3
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi1
Text Generation
•
2B
•
Updated
•
4
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi13
Text Generation
•
2B
•
Updated
•
2
ypwang61/One-Shot-RLVR-Qwen2.5-Math-7B-pi1_pi13
Text Generation
•
8B
•
Updated
•
3
ypwang61/One-Shot-RLVR-Qwen2.5-Math-7B-pi1
Text Generation
•
8B
•
Updated
•
2
7B
•
Updated
•
1
•
1
ypwang61/negCLIPLoss_NormSim