AIMO AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 708 • 350 Running 421 Reward Bench Leaderboard 📐 421 Explore RewardBench model rankings and scores KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
ARC 01-ai/Yi-Coder-9B-Chat Text Generation • 9B • Updated Sep 12, 2024 • 7.58k • 212 AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 708 • 350
AIMO AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 708 • 350 Running 421 Reward Bench Leaderboard 📐 421 Explore RewardBench model rankings and scores KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
ARC 01-ai/Yi-Coder-9B-Chat Text Generation • 9B • Updated Sep 12, 2024 • 7.58k • 212 AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 708 • 350