THUDM/GLM-4-32B-0414

#844

by x0wllaar - opened Apr 14

Apr 14

•

Should be already supported by llama.cpp

Owner Apr 14

Should be already supported by llama.cpp

I haven't checked all, but unfortunately, I don't think any of them are supported yet :(

mradermacher changed discussion status to closed Apr 14

Apr 15

llama.cpp can run them, there are other quants on here, here's the merged PR: https://github.com/ggml-org/llama.cpp/pull/12867.

The thing is that currently, silent corruption happens when trying to quantize, see https://github.com/ggml-org/llama.cpp/pull/12957 for fix

Owner Apr 16

Just drop us a note once llama.cpp has support for these models, and we will quantize them.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment