Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 6 days ago • 35
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference Paper • 2504.05897 • Published Apr 8, 2025 • 21