Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios By pratikbhavsar and 1 other β’ Feb 12 β’ 25
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr β’ Feb 11 β’ 67
**MindBot Ultra β Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation** By TheMindExpansionNetwork β’ Feb 11 β’ 3
Announcing the winners of the Frugal AI Challenge π± By frugal-ai-challenge and 1 other β’ Feb 11 β’ 9
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy β’ Feb 11 β’ 66
From Llasa to Llasagna π: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other β’ Feb 11 β’ 32
Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect By atlasia and 2 others β’ Feb 10 β’ 14
Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems By Navid-AI and 1 other β’ Feb 9 β’ 14
Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501 By ruslanmv β’ Feb 8 β’ 3