Datasets, and model checkpoints of our Group Relative Reward Model (GRRM) framework
Sen Yang PRO
double7
AI & ML interests
None yet
Recent Activity
updated
a model about 2 hours ago
double7/Qwen2.5-7B-GRRM updated
a collection
1 day ago
GRRM updated
a collection
1 day ago
GRRM Organizations
None yet