MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11 • 356
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? +5 May 11 • 86
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 5 days ago • 60
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published Feb 3 • 222
MangaNinja: Line Art Colorization with Precise Reference Following Paper • 2501.08332 • Published Jan 14 • 61
BhasaAnuvaad Collection A Speech Translation Dataset for 13 Indian Languages • 11 items • Updated Jan 16 • 22