Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen3-Omni-30B-A3B-Instruct
like
733
Follow
Qwen
59.4k
Any-to-Any
Transformers
Safetensors
English
qwen3_omni_moe
text-to-audio
multimodal
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
29
Deploy
Use this model
VLLM 和 Sglang 部署 Qwen3-Omni,GPU 的利用率都不高,请问是什么原因? #114
#27
by
yixue
- opened
Oct 31
Discussion
yixue
Oct 31
VLLM 和 Sglang 部署 Qwen3-Omni,GPU 的利用率都不高,请问是什么原因?跑更大的文本大模型,都不会出现这样的情况。
See translation
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment