🎚️ Batch Normalization — Quand ton réseau a besoin de chill pills ! 😤➡️😌 By RDTvlokip • 11 days ago • 2
🎚️ Batch Normalization — When your neural network needs anger management! 😤➡️😌 By RDTvlokip • 11 days ago • 2
Australian-made LLM beats OpenAI and Google at legal retrieval By isaacus and 2 others • 11 days ago • 25
Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes By nvidia and 1 other • 12 days ago • 8
TIL: How a Harmless Refactor Exposed a Hidden CUDA Bug in Vision-Language Models By albertvillanova • 12 days ago
Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard By nvidia and 4 others • 13 days ago • 13
🔄 Transfer Learning — Quand l'IA apprend de l'expérience comme toi ! 🎓🚀 By RDTvlokip • 13 days ago • 2
Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models By nvidia and 3 others • 14 days ago • 17
Art of Focus: Page-Aware Sparse Attention and Ling 2.0’s Quest for Efficient Context Length Scaling By RichardBian and 19 others • 14 days ago • 14
Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text By isaacchung and 2 others • 14 days ago • 33
GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms By otellm and 15 others • 14 days ago • 16