Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Puyuan Peng's picture
9 1

Puyuan Peng

pyp1
isasah's profile picture Hughwang's profile picture Kenano's profile picture
·
https://jasonppy.github.io/
  • PuyuanPeng
  • jasonppy

AI & ML interests

Speech/Audio Generation, Speech Processing, Multimodal Learning

Organizations

ZeroGPU Explorers's profile picture TTS AGI's profile picture

Papers 1

arxiv:2505.13444

spaces 1

Build error
161

VoiceCraft

📈

May 6, 2024

models 9

pyp1/Encodec_VoiceStar

Updated Apr 8 • 1

pyp1/VoiceStar

Updated Apr 6 • 2

pyp1/VoiceCraft

Text-to-Speech • 0.3B • Updated Aug 21, 2024 • 13 • 212

pyp1/VoiceCraft_615M_8cb1024_se

Text-to-Speech • 0.6B • Updated Aug 21, 2024 • 2

pyp1/VoiceCraft_330M_TTSEnhanced

Text-to-Speech • 0.3B • Updated Apr 25, 2024 • 28 • 1

pyp1/VoiceCraft_830M_TTSEnhanced

Text-to-Speech • 0.8B • Updated Apr 21, 2024 • 164 • 8

pyp1/VoiceCraft_giga330M

Text-to-Speech • 0.3B • Updated Apr 16, 2024 • 127

pyp1/VoiceCraft_giga830M

Text-to-Speech • 0.8B • Updated Apr 16, 2024 • 196 • 1

pyp1/VoiceCraft_gigaHalfLibri330M_TTSEnhanced_max16s

Text-to-Speech • 0.3B • Updated Apr 16, 2024 • 28 • 1

datasets 1

pyp1/VoiceCraft_RealEdit

Viewer • Updated Mar 25, 2024 • 7.89M • 20 • 5
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs