Liger Kernel: Efficient Triton Kernels for LLM Training Paper • 2410.10989 • Published Oct 14, 2024 • 1
LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding Paper • 2508.01617 • Published Aug 3
Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction Paper • 2509.12464 • Published Sep 15
Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs Paper • 2509.25779 • Published 22 days ago • 16