Ai-Model
-
Image-Text-to-Text β’ 25B β’ Updated β’ 60.9k β’ 637 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition β’ Updated β’ 4.79M β’ β’ 2.86k -
SWivid/F5-TTS
Text-to-Speech β’ Updated β’ 857k β’ 1.15k -
D-Edit
π84 -
FacePoke
π2.21kImport a portrait, click to move the head!
-
Expression Editor
π¨1.63kQuickly edit the expression of a face
-
F5-TTS
π£2.82kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
FLUX.1 [dev]
π₯9.41kGenerate images from text prompts
-
Face Recognition SDK
π’234Face Recognition
-
Open NotebookLM
π1.09kPersonalised Podcasts For All - Available in 13 Languages
-
PMRF
πΌ314A gradio demo for Posterior-Mean Rectified Flow (PMRF)
-
stabilityai/stable-diffusion-3.5-large
Text-to-Image β’ Updated β’ 80.6k β’ β’ 3.38k -
genmo/mochi-1-preview
Text-to-Video β’ Updated β’ 10.8k β’ β’ 1.31k -
Freepik/flux.1-lite-8B-alpha
Text-to-Image β’ Updated β’ 381 β’ 427 -
rhymes-ai/Allegro
Text-to-Video β’ Updated β’ 186 β’ 264 -
CohereLabs/aya-expanse-8b
Text Generation β’ 8B β’ Updated β’ 15.9k β’ 422 -
deepseek-ai/Janus-1.3B
Any-to-Any β’ 2B β’ Updated β’ 4.3k β’ 593 -
Pangea
π50A Fully Open Multilingual Multimodal LLM for 39 Languages
-
Etched/oasis-500m
Updated β’ 94 β’ 490 -
microsoft/OmniParser
Image-Text-to-Text β’ Updated β’ 419 β’ 1.71k -
OuteAI/OuteTTS-0.1-350M
Text-to-Speech β’ Updated β’ 303 β’ 302 -
tencent/Tencent-Hunyuan-Large
Text Generation β’ Updated β’ 977 β’ 617 -
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation β’ 71B β’ Updated β’ 11.2k β’ 2.06k -
tencent/HunyuanVideo
Text-to-Video β’ Updated β’ 852 β’ β’ 2.14k -
zai-org/CogVideoX-5b
Text-to-Video β’ Updated β’ 30.1k β’ β’ 665 -
LanguageBind/Open-Sora-Plan-v1.2.0
Updated β’ 2 β’ 47 -
microsoft/phi-4
Text Generation β’ Updated β’ 983k β’ 2.22k -
TRELLIS
π’4.78kScalable and Versatile 3D Generation from images
-
Search Your Face Online
π834Track your online presence with reverse face search
-
Kolors Virtual Try-On
π10kTry on clothes on a person image
-
DeepSeek-R1 WebGPU
π§554Next-generation reasoning model that runs locally in-browser
-
AnyCoder
π3.17kGenerate code instantly from natural language prompts
-
tencent/Hunyuan3D-2
Image-to-3D β’ Updated β’ 92.4k β’ 1.72k -
openbmb/MiniCPM-o-2_6
Any-to-Any β’ 9B β’ Updated β’ 115k β’ 1.29k -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation β’ Updated β’ 101k β’ β’ 755 -
Magic Face
π€ͺ244Transform Your Face Into Legendary Characters!
-
Llasa 3b Tts
π₯313Zero Shot voice cloning with llasa 3b (Unofficial Demo)
-
mistralai/Mistral-Small-24B-Instruct-2501
Updated β’ 107k β’ 948 -
Pyramid Flow
β±670Generate videos from text prompts and optional images
-
microsoft/OmniParser-v2.0
Updated β’ 1.1k β’ 1.3k -
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech β’ Updated β’ 2.78k β’ 1.1k -
perplexity-ai/r1-1776
Text Generation β’ Updated β’ 553 β’ 2.32k -
agentica-org/DeepScaleR-1.5B-Preview
Text Generation β’ 2B β’ Updated β’ 13.5k β’ 574 -
stepfun-ai/Step-Audio-Chat
Audio-Text-to-Text β’ 132B β’ Updated β’ 28 β’ 458 -
hexgrad/Kokoro-82M
Text-to-Speech β’ Updated β’ 9.09M β’ β’ 5.82k -
black-forest-labs/FLUX.1-dev
Text-to-Image β’ Updated β’ 784k β’ β’ 12.5k -
NousResearch/DeepHermes-3-Llama-3-8B-Preview
Text Generation β’ 8B β’ Updated β’ 131 β’ β’ 351