Puyuan Peng's picture

Puyuan Peng

pyp1

·

https://jasonppy.github.io/

AI & ML interests

Speech/Audio Generation, Speech Processing, Multimodal Learning

Organizations

Papers 1

arxiv:2505.13444

spaces 1

VoiceCraft

models 9

pyp1/Encodec_VoiceStar

Updated Apr 8, 2025 • 1

pyp1/VoiceStar

Updated Apr 6, 2025 • 2

pyp1/VoiceCraft

Text-to-Speech • Updated Aug 21, 2024 • 24 • 212

pyp1/VoiceCraft_615M_8cb1024_se

Text-to-Speech • Updated Aug 21, 2024 • 9

pyp1/VoiceCraft_330M_TTSEnhanced

Text-to-Speech • Updated Apr 25, 2024 • 10 • 1

pyp1/VoiceCraft_830M_TTSEnhanced

Text-to-Speech • Updated Apr 21, 2024 • 94 • 8

pyp1/VoiceCraft_giga330M

Text-to-Speech • Updated Apr 16, 2024 • 26

pyp1/VoiceCraft_giga830M

Text-to-Speech • Updated Apr 16, 2024 • 83 • 1

pyp1/VoiceCraft_gigaHalfLibri330M_TTSEnhanced_max16s

Text-to-Speech • Updated Apr 16, 2024 • 27 • 1

datasets 1

pyp1/VoiceCraft_RealEdit

Viewer • Updated Mar 25, 2024 • 7.89M • 39 • 5