Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
QuantTrio
/
Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
like
2
Follow
QuantTrio
135
Text Generation
Transformers
Safetensors
qwen3_moe
Qwen3
GPTQ
Int4-Int8Mix
量化修复
vLLM
conversational
4-bit precision
gptq
arxiv:
2505.09388
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Sort: Recently created
Error on loading in VLLM, what i am doing wrong?
7
#1 opened 3 months ago by
djdeniro