qgallouedec
·
AI & ML interests
None yet
Recent Activity
Organizations
qgallouedec/test-grpo-vlm-log-completions
Viewer
• Updated • 435 • 140
qgallouedec/llama_star_formatted
Viewer
• Updated • 7.21k • 10
qgallouedec/deepmath-completions-logs2
Viewer
• Updated • 48 • 63
qgallouedec/deepmath-completions-logs
Viewer
• Updated • 232 • 54
• 1
qgallouedec/Dolci-Think-DPO-7B
Viewer
• Updated • 150k • 15
Viewer
• Updated • 59.4k • 401
qgallouedec/human_gene_interaction_qa_v2
Viewer
• Updated • 79.2k • 12
qgallouedec/human_gene_interaction_qa
Viewer
• Updated • 1.84M • 14
Viewer
• Updated • 2.82M • 863
Viewer
• Updated • 148k • 65
• 1
Viewer
• Updated • 1.18k • 8
qgallouedec/OpenMathReasoning
Viewer
• Updated • 10k • 37
qgallouedec/math-lvl3to5-8k
Viewer
• Updated • 8.52k • 14
Viewer
• Updated • 900 • 8
• 1
qgallouedec/rick-physics-grpo
Viewer
• Updated • 1.79k • 22
• 1
Viewer
• Updated • 1.18k • 22
• 3
qgallouedec/physics-problems
Viewer
• Updated • 247 • 9
qgallouedec/rick-teaches-math
Viewer
• Updated • 6.8k • 17
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
• Updated • 16.4k • 12
• 3
Viewer
• Updated • 41.2k • 22
• 3
qgallouedec/ultrafeedback-prompt
Viewer
• Updated • 60.9k • 16
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
• Updated • 16.6k • 9
qgallouedec/lm-human-preferences-descriptiveness
Viewer
• Updated • 6.26k • 15
qgallouedec/lm-human-preferences-sentiment
Viewer
• Updated • 6.26k • 16
qgallouedec/tldr-preference
Viewer
• Updated • 179k • 7
Viewer
• Updated • 130k • 17
qgallouedec/hh-rlhf-helpful-base
Viewer
• Updated • 46.2k • 6
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
• Updated • 46.2k • 13
qgallouedec/suap_essentials
Viewer
• Updated • 30 • 15
Viewer
• Updated • 270 • 5