view post Post 3922 simple guide on the recipe for GRPO on Open-R1 which is built on top of TRL I think FastAPI wrapper of vLLM with WeightSyncWorker is pretty cool feature. Also, we have many predefined reward functions out of the box! See translation 5 replies · ❤️ 17 17 + Reply
view article Article You could have designed state of the art positional encoding By FL33TW00D-HF • Nov 25, 2024 • 335
view article Article Understanding Gemma 3n: How MatFormer Gives You Many Models in One By rishiraj • Jun 26 • 38
view article Article State of open video generation models in Diffusers By sayakpaul and 2 others • Jan 27 • 59
view article Article How Long Prompts Block Other Requests - Optimizing LLM Performance By tngtech • Jun 12 • 5
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance By tngtech • Apr 16 • 30