Projects I've worked on (or contributed to)
mrfakename PRO
mrfakename
AI & ML interests
LLMs, TTS, & Open Source
Recent Activity
updated
a dataset
29 minutes ago
emoact/pretokenized
updated
a model
about 19 hours ago
mrfakename/merged-model-with-unfrozen-layers
published
a model
about 19 hours ago
mrfakename/merged-model-with-unfrozen-layers
Organizations
OpenF5 TTS
The OpenF5 TTS model series (currently OpenF5 TTS Base - more variants coming soon 👀)
Zero-Shot Voice Cloning
TTS models that support zero-shot voice cloning
-
MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Paper • 2502.18924 • Published • 16 -
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer
Paper • 2409.00750 • Published • 5 -
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Paper • 2410.06885 • Published • 46 -
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
Paper • 2409.10058 • Published • 2
Spaces of the Week
My spaces or spaces I worked featured on Spaces of the Week! Ones at the top are the oldest, newest at the bottom 🤗
-
Running on L4Featured709
StyleTTS 2
🗣709Efficient, fast, and natural text to speech with StyleTTS 2!
-
Running on ZeroFeatured415
OpenDalle V1.1 GPU Demo
🖼415A demo of OpenDalle V1.1 on a ZERO GPU.
-
Runtime errorFeatured74
RWKV Music
🎵74Generate MIDI music using RWKV v4!
-
Running on CPU UpgradeFeatured902
TTS Arena V2
🏆902Vote on the latest TTS models!
Podcast Pile
EmoAct
Llamafied Models
Models converted to the Llama format
-
mrfakename/Apriel-5B-Instruct-llamafied
Text Generation • 5B • Updated • 4 • 4 -
mrfakename/Apriel-5B-Base-llamafied
Text Generation • 5B • Updated • 2 -
llamafy/Qwen-Qwen2.5-1.5B-llamafied
Text Generation • 2B • Updated • 1 -
llamafy/Qwen-Qwen2.5-1.5B-Instruct-llamafied
Text Generation • 2B • Updated • 10
Failed Experiments
Experiments that didn't work out.
Projects
Projects I've worked on (or contributed to)
Podcast Pile
OpenF5 TTS
The OpenF5 TTS model series (currently OpenF5 TTS Base - more variants coming soon 👀)
EmoAct
Zero-Shot Voice Cloning
TTS models that support zero-shot voice cloning
-
MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Paper • 2502.18924 • Published • 16 -
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer
Paper • 2409.00750 • Published • 5 -
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Paper • 2410.06885 • Published • 46 -
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
Paper • 2409.10058 • Published • 2
Llamafied Models
Models converted to the Llama format
-
mrfakename/Apriel-5B-Instruct-llamafied
Text Generation • 5B • Updated • 4 • 4 -
mrfakename/Apriel-5B-Base-llamafied
Text Generation • 5B • Updated • 2 -
llamafy/Qwen-Qwen2.5-1.5B-llamafied
Text Generation • 2B • Updated • 1 -
llamafy/Qwen-Qwen2.5-1.5B-Instruct-llamafied
Text Generation • 2B • Updated • 10
Spaces of the Week
My spaces or spaces I worked featured on Spaces of the Week! Ones at the top are the oldest, newest at the bottom 🤗
-
Running on L4Featured709
StyleTTS 2
🗣709Efficient, fast, and natural text to speech with StyleTTS 2!
-
Running on ZeroFeatured415
OpenDalle V1.1 GPU Demo
🖼415A demo of OpenDalle V1.1 on a ZERO GPU.
-
Runtime errorFeatured74
RWKV Music
🎵74Generate MIDI music using RWKV v4!
-
Running on CPU UpgradeFeatured902
TTS Arena V2
🏆902Vote on the latest TTS models!
Failed Experiments
Experiments that didn't work out.