Qwen3 235B Instruct 2507 gguf

Make sure you have enough ram/gpu to run. On the right of model card, you may see the size of each quantized models.

Use the model in LM Studio

you may choose the quantized type in the download window.

Load the model to the LM Studio.

Type	Bits	Quality	Description
Q2_K	2-bit	🟥 Low	Minimal footprint; only for tests
Q3_K_S	3-bit	🟧 Low	“Small” variant (less accurate)
Q3_K_M	3-bit	🟧 Low–Med	“Medium” variant
Q4_K_S	4-bit	🟨 Med	Small, faster, slightly less quality
Q4_K_M	4-bit	🟩 Med–High	“Medium” — best 4-bit balance
Q6_K	6-bit	🟩🟩🟩 Very High	Almost FP16 quality, larger size
Q8_0	8-bit	🟩🟩🟩🟩	Near-lossless baseline

GGUF

Model size

235B params

Architecture

qwen3moe

Hardware compatibility

2-bit

3-bit

4-bit

6-bit

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(52)

this model