Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
galois77 's Collections
Thousand brains theory
THE ORB
energy based models
OCR
Poetry
Multi-language
Agentic
Multimodal
Inference
Check-later
Videos
ahan
Image generation
Training optimization
RL
Reasoning
Benchmarks and challenges
Instructions
Evaluators

THE ORB

updated 2 days ago
Upvote
-

  • UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

    Paper • 2511.08521 • Published 5 days ago • 34

  • Black-Box On-Policy Distillation of Large Language Models

    Paper • 2511.10643 • Published 3 days ago • 35

  • Depth Anything 3: Recovering the Visual Space from Any Views

    Paper • 2511.10647 • Published 3 days ago • 35

  • VGGT: Visual Geometry Grounded Transformer

    Paper • 2503.11651 • Published Mar 14 • 33

  • Music Flamingo: Scaling Music Understanding in Audio Language Models

    Paper • 2511.10289 • Published 3 days ago • 8
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs