Runtime error 24 Gaia2 Agents Evaluation Leaderboard 🐠 24 Display and submit model evaluation results on a leaderboard
Running 95 Nexus Function Calling Leaderboard 🐠 95 Display benchmark results for models on various tasks