Running 3.76k The Ultra-Scale Playbook π 3.76k The ultimate guide to training LLM on large GPU Clusters
AMDGPU onnx Collection optimized image generation ONNX models for AMD Ryzen (TM) AI GPUs and Radeon Discrete GPUs β’ 19 items β’ Updated Mar 2 β’ 11
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training Paper β’ 2505.11594 β’ Published May 16, 2025 β’ 75
Elucidating the Design Space of Diffusion-Based Generative Models Paper β’ 2206.00364 β’ Published Jun 1, 2022 β’ 18
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 32 items β’ Updated 22 days ago β’ 151