Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Josh's picture
5 10

Josh PRO

ACloudCenter
Harmon12345's profile picture waldon54367's profile picture mufeed's profile picture
ยท

AI & ML interests

Real-Time AI applications, ASR, TTS, STT, and streaming media models

Recent Activity

replied to their post 4 days ago
I've really been into testing the various ASR, TTS, and other audio related models. This space showcases the Nvidia Canary-Qwen 2.5B model. The model is able to transcribe incredibly fast and and combine qwen for queries about the transcript. All audio example files were generated with my adjacent VibeVoice Conference Generator Space. Another really cool model!! https://huggingface.co/spaces/ACloudCenter/canary-qwen-transcriber-2.5b
new activity 6 days ago
broadfield-dev/VibeVoice-demo-dev:ZeroGPU Timeout feedback
new activity 11 days ago
microsoft/VibeVoice-1.5B:The github repo is deleted
View all activity

Organizations

A Cloud Center's profile picture

ACloudCenter 's Spaces 2

Running on Zero
3

Canary Qwen Transcriber 2.5b

๐Ÿ“

Transcribe audio and ask questions about the transcript

11 days ago
Running on Zero
1

ACE Step

๐Ÿ˜ป

A Step Towards Music Generation Foundation Model

14 days ago
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs