Miscellaneous - a GayatriValley Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

GayatriValley 's Collections

Miscellaneous

updated Dec 13, 2024

Build error

Featured

791

Unique3D

⚡

791

Create a 1M faces 3D colored model from an image!
Runtime error

53

Paligemma Doc

📚

53

Try PaliGemma on document understanding tasks
wangfuyun/PCM_Weights

Text-to-Image • Updated Oct 30, 2024 • 118 • 98
Running on Zero

450

Stable Audio Open Zero

🔥

450

Generate audio from text prompts
Paused

Featured

314

PaliGemma Demo

🤲

314

Annotate and describe images with text prompts
Runtime error

42

T2V Turbo

🖼

42

Fastest high-quality video diffusion model.
atcsecure/dolphin-2.9.2-qwen72b-8.0bpw-h8-exl2

Text Generation • Updated Jun 9, 2024 • 5 • 2
stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 222k • 3.21k
DAMO-NLP-SG/VideoLLaMA2-7B

Visual Question Answering • 8B • Updated Aug 13, 2024 • 309 • 42
SakanaAI/DiscoPOP-zephyr-7b-gemma

Text Generation • 9B • Updated Jun 13, 2024 • 33 • 36
madebyollin/taesd3

Updated Jun 14, 2024 • 360 • 38
hpcai-tech/OpenSora-VAE-v1.2

0.4B • Updated Jun 17, 2024 • 5.56k • 57
Running

Featured

84

NaRCan

💊

84

Transform videos into art using prompts
MaziyarPanahi/calme-2.1-qwen2-72b-GGUF

Text Generation • 73B • Updated Aug 2, 2024 • 10.9k • 13
Build error

Featured

93

DiffIR2VR

👌

93

Video upscaler/restorer
CAMB-AI/MARS5-TTS

Text-to-Speech • Updated Jul 5, 2024 • 90 • 481
dphn/dolphin-vision-72b

Text Generation • 73B • Updated Jul 16, 2024 • 247 • 132
Running on Zero

Featured

72

Florence-2 for Videos

🎬

72

Annotate and summarize video content
Running on Zero

132

FLUX.1-dev + Captioner

🐨

132

Generate images from prompts or images
Runtime error

Featured

367

Video Transcription Smart Summary

⚡

367

Generate summaries from YouTube videos or uploaded videos
qnguyen3/nanoLLaVA-1.5

Image-Text-to-Text • 1B • Updated Sep 21, 2024 • 126 • 110
Runtime error

Featured

124

nanoLLaVA-1.5

🚀

124

Chat about images by uploading them
zai-org/codegeex4-all-9b

Text Generation • 9B • Updated Jul 18, 2024 • 2.64k • 262
Sleeping

10

Langflow Crewai

💻

10

Build and run language models visually
Running on Zero

Featured

941

Tile Upscaler

🚀

941

Enhance and upscale images with advanced controls
Running

Featured

217

Whisper Timestamped

🕒

217

In-browser speech recognition w/ word-level timestamps
Runtime error

Featured

2.04k

IDM VTON

👕

2.04k

High-fidelity Virtual Try-on
deepseek-ai/DeepSeek-V2-Chat-0628

Text Generation • 236B • Updated Jul 18, 2024 • 447 • 177
TheDrummer/Big-Tiger-Gemma-27B-v1-GGUF

27B • Updated Jul 14, 2024 • 1k • 73
fal/AuraFlow

Text-to-Image • Updated Jul 18, 2024 • 413 • • 652
xinsir/controlnet-union-sdxl-1.0

Text-to-Image • Updated Jul 30, 2024 • 120k • 1.64k
TheBloke/MythoMax-L2-13B-GPTQ

Text Generation • 13B • Updated Sep 27, 2023 • 496 • 215
Gryphe/MythoMax-L2-13b

Text Generation • Updated Apr 21, 2024 • 3.28k • • 365
Gryphe/Pantheon-RP-1.0-8b-Llama-3

Text Generation • 8B • Updated May 13, 2024 • 32 • • 51
Gryphe/Tiamat-8b-1.2-Llama-3-DPO

Text Generation • 8B • Updated May 3, 2024 • 14 • 6
BeaverLegacy/Smegmma-9B-v1

Text Generation • 10B • Updated Jul 13, 2024 • 25 • 50
mradermacher/Nymph_8B-i1-GGUF

8B • Updated Aug 2, 2024 • 347 • 2
Runtime error

29

MusiConGen

🪩

29
mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated

Text Generation • 8B • Updated Sep 14, 2024 • 1.78k • • 187
FunAudioLLM/SenseVoiceSmall

Updated Jul 31, 2024 • 914 • 336
Running on Zero

MCP

24

Video-to-Audio Ldm

🎧

24

Video-to-Audio Generation with Hidden Alignment
CofeAI/Tele-FLM-1T

Text Generation • Updated Jul 29, 2024 • 382 • 82
maxin-cn/Cinemo

Image-to-Video • Updated Aug 14, 2024 • 34 • 32
Running on Zero

Featured

204

Cinemo

🎥

204

Multimodal Image-to-Video
Running

20

Mms Zeroshot

🌍

20

Transcribe audio in any language using text data
Running on Zero

Featured

56

AccDiffusion

🏆

56

Generate images from text prompts
Runtime error

Featured

185

Artist

🎨

185

Aesthetically Controllable Text-Driven Stylization w/o Train
Runtime error

95

EchoMimic

🐨

95

Generate lifelike video animations from images and audio
HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • 8B • Updated Dec 2, 2024 • 96.7k • 301
parler-tts/parler-tts-mini-v1

Text-to-Speech • 0.9B • Updated Nov 25, 2024 • 10.7k • 152
parler-tts/parler-tts-large-v1

Text-to-Speech • 2B • Updated Nov 22, 2024 • 18.2k • 266
Qwen/Qwen2-Audio-7B

Audio-Text-to-Text • 8B • Updated Nov 20, 2024 • 34.9k • 155
black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27 • 741k • • 12.1k
Runtime error

214

CatVTON

🐈

214

Try on clothes virtually with images
wanglab/ecg-fm

Updated May 5 • 14
XLabs-AI/flux-lora-collection

Text-to-Image • Updated Aug 14, 2024 • 580
Runtime error

58

Vgg Heads

🖼

58
migtissera/Tess-3-Mistral-Nemo-12B

12B • Updated Sep 4, 2024 • 24 • 13
nisten/all-human-diseases

Viewer • Updated Aug 19, 2024 • 2.2k • 177 • 105
DAMO-NLP-SG/VideoLLaMA2-72B

Visual Question Answering • 75B • Updated Aug 14, 2024 • 23 • 10
answerdotai/answerai-colbert-small-v1

33.4M • Updated Nov 18, 2024 • 1M • 156
mlabonne/Hermes-3-Llama-3.1-8B-lorablated-GGUF

8B • Updated Aug 16, 2024 • 1.61k • 29
labotollama3/lobotollama-5.5b

Text Generation • 6B • Updated Apr 22, 2024 • 8 • 4
Mozilla/whisperfile

Updated Oct 2, 2024 • 1.43k • 255
Running

45

FAI Fuzer Medium v0.3

🎨

45

Generate enhanced images by blending foreground with custom backgrounds
ZhengPeng7/BiRefNet

Image Segmentation • 0.2B • Updated Sep 28 • 653k • 500
Running on CPU Upgrade

9.94k

Kolors Virtual Try-On

👕

9.94k

Try on clothes on a person image
fal/AuraFace-v1

Updated Aug 26, 2024 • 139
dphn/dolphin-2.9.4-gemma2-2b

3B • Updated Aug 27, 2024 • 45 • 38
pzc163/MiniCPMv2_6-prompt-generator

Updated Aug 24, 2024 • 96 • 49
Running on Zero

1.02k

CogVideoX-5B

🎥

1.02k

Text-to-Video
yifeihu/TB-OCR-preview-0.1

Image-Text-to-Text • 4B • Updated Sep 6, 2024 • 42 • 129
InstantX/FLUX.1-dev-Controlnet-Union

Updated Aug 26, 2024 • 10.4k • 468
Running on Zero

Featured

86

Qwen2-VL-2B

🔥

86

Generate text from images or videos
Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12 • 1.64M • 477
Running

Featured

59

Groq Gradio Voice Assistant

👁

59

Transcribe speech and generate AI response
IntelLabs/LlavaOLMoBitnet1B

Updated Aug 30, 2024 • 63 • 29
facebook/sapiens

Updated Sep 20, 2024 • 132 • 243
Running on Zero

28

Tb Ocr

📈

28

Convert image text to markdown format
YuWangX/memoryllm-8b-chat

10B • Updated Nov 17, 2024 • 813 • 20
Running

210

HivisionIDPhotos

🌖

210

Remove backgrounds from ID photos
virtuals-protocol/mario-videogamegen

Updated Sep 6, 2024 • 13
Running on Zero

266

Qwen2-VL-7B

🔥

266

Generate text from an image and question
Running on Zero

Featured

278

Latent Navigation

🪐

278

Travel through the model latent space
mattshumer/Reflection-Llama-3.1-70B

Text Generation • 71B • Updated Sep 24, 2024 • 433 • 1.71k
Configuration error

Featured

116

ViewCrafter

🐨

116

Create a video from an image with camera motion
Runtime error

18

Text Image Analyzer

💻

18

Analyse any image with Llama3.2
vidore/colqwen2-v0.1

Visual Document Retrieval • Updated Mar 21 • 80.4k • 193
Runtime error

12

Llama 3.2 Vision Free

🐢

12
facebook/Self-taught-evaluator-llama3.1-70B

Updated Sep 30, 2024 • 42
openai/clip-vit-large-patch14-336

Zero-Shot Image Classification • Updated Oct 4, 2022 • 4.45M • 282
jasperai/Flux.1-dev-Controlnet-Upscaler

Image-to-Image • Updated Mar 22 • 5.16k • 853
Running on Zero

Featured

326

Diffusers Image Fill

🏃

326

Fill and edit images using masks
Running

36

PDF to Page Images Dataset

📂

36

Convert PDFs to individual page images
Running on Zero

Featured

72

ColPali fine-tuning Query Generator

🔍

72

Generate document retrieval queries from an image
Running on Zero

10

Vision Pipeline

🌍

10

Generate answers from images and text queries
nvidia/NVLM-D-72B

Image-Text-to-Text • 79B • Updated Jan 14 • 103k • 775
Running on Zero

994

Whisper Turbo

🤯

994

Transcribe audio or YouTube videos into text
davanstrien/ufo-ColPali

Viewer • Updated Sep 23, 2024 • 2.24k • 129 • 25
jadechoghari/openmusic

Text-to-Audio • Updated Oct 10, 2024 • 194 • 72
Build error

214

OpenMusic

🎶

214

Generate music from text descriptions
Running

452

PDF2Audio

📚

452

Transform text into engaging podcast dialogues or detailed reports
Running on Zero

239

Ultrapixel-demo

😻

239

Ultra-high resolution image synthesis
PleIAs/OCRonos-Vintage

Text Generation • 0.1B • Updated Aug 8, 2024 • 138 • 81
Running on Zero

275

EzAudio

🟣

275

Generate and edit audio from text prompts
stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4 • 15.3k • 1.53k
Running on CPU Upgrade

954

Open VLM Leaderboard

🌎

954

VLMEvalKit Evaluation Results Collection
Build error

64

ArxivCopilot

🏢

64

Generate personalized research profiles and chat with Arxiv Copilot
gpt-omni/mini-omni

Text-to-Speech • Updated Sep 4, 2024 • 4 • 437
mistral-community/pixtral-12b-240910

Image-Text-to-Text • Updated Oct 1, 2024 • 50 • 382
ICTNLP/Llama-3.1-8B-Omni

9B • Updated Nov 14, 2024 • 136 • 417
fishaudio/fish-speech-1.4

Text-to-Speech • Updated Nov 5, 2024 • 320 • 454
bartowski/Reflection-Llama-3.1-70B-GGUF

Text Generation • 71B • Updated Sep 7, 2024 • 1.7k • 53
lelapa/InkubaLM-0.4B

Text Generation • Updated Sep 5, 2024 • 208 • 57
Running

143

Qwen 2.5 Code Interpreter

🐍

143

Execute code snippets and get results
Runtime error

311

Virtual Try On

👕

311

High-fidelity Virtual Try-on
Runtime error

36

Ferret Demo

📚

36

Describe image contents with prompts
Running on L4

61

ColPali 🤝 Vespa - Visual Retrieval

👀

61

Visual Retrieval with ColPali and Vespa
oxyapi/oxy-1-small

Text Generation • 15B • Updated Apr 30 • 958 • • 83
QuantFactory/MN-Chunky-Lotus-12B-GGUF

12B • Updated Dec 4, 2024 • 374 • 4
Running

25

ScholarCopilot

📊

25

Using RAG LLM to assist your academic writing
Running on Zero

603

Leffa

👗

603

Generate person images with new clothes or poses
Lightricks/LTX-Video

Image-to-Video • Updated Jul 16 • 182k • • 2.08k

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs