https://huggingface.co/collections/qingy2024/qwen-3-vlto

#1482
by qingy2024 - opened

Would love some GGUFs of the 32B Thinking/Instruct. It's basically Qwen3 VL 32B Instruct/Thinking but without the vision part. So essentially it's an upgraded Qwen3 32B that works with llama.cpp.

They are all queued! :D
Thanks a lot for creating them. Qwen3 VL not yet being supported by llama.cpp is such a shame. I wasn't aware that you can simply remove the vision part from them. That's so cool. Great work!

You can check for progress at http://hf.tst.eu/status.html or regularly check the model summary pages for quants to appear under:

Awesome, thanks for queueing them!!
I just checked and apparently the Qwen3 VL PR was approved just 2 hours ago, so it looks like support is coming soon anyway, but the new 32B's and 8B's have some nice text-only performance improvements so I think it's pretty nice for people who want to fine-tune/run without having to deal with the more finicky aspect of VL models :)

qingy2024 changed discussion status to closed

Sign up or log in to comment