Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
blanchefort 's Collections
Medical
VLA models
Audio
Translate
OCR
OmniModels
Edge models
Video encoders
Judge
Datasets for Embodied
Ru text encoders
Text2Image
VLMs

Audio

updated 12 days ago
Upvote
-

  • nvidia/audio-flamingo-3-hf

    Audio-Text-to-Text • 8B • Updated 13 days ago • 98k • 171

  • facebook/sam-audio-large

    Updated Dec 30, 2025 • 12.6k • 366

  • google/medasr

    Automatic Speech Recognition • Updated 14 days ago • 22.4k • 276

  • FunAudioLLM/Fun-CosyVoice3-0.5B-2512

    Text-to-Speech • Updated 7 days ago • 4.61k • 446

  • facebook/sam-audio-large-tv

    Updated Dec 30, 2025 • 1.18k • 24

  • Qwen/Qwen3-TTS-12Hz-0.6B-Base

    Text-to-Speech • Updated 12 days ago • 174k • 163
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs