Melvin56
/

GLM-Z1-9B-0414-GGUF

Text Generation

Model card Files Files and versions

Melvin56 commited on Apr 28

Commit

7c3d18d

·

verified ·

1 Parent(s): b69bcc1

Update README.md

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -12,14 +12,15 @@ base_model:
 Original Model : [THUDM/GLM-Z1-9B-0414](https://huggingface.co/THUDM/GLM-Z1-9B-0414)
-Llama.cpp build: 558a7647 (5190)
 I used imatrix to create all these quants using this [Dataset](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8).
-```
-Update-01
-  * [Fixed Quant] Re-quantized all quants with build: 558a7647 (5190)
-```
 |               | CPU (AVX2) | CPU (ARM NEON) | Metal | cuBLAS | rocBLAS | SYCL | CLBlast | Vulkan | Kompute |
 | :------------ | :---------: | :------------: | :---: | :----: | :-----: | :---: | :------: | :----: | :------: |

 Original Model : [THUDM/GLM-Z1-9B-0414](https://huggingface.co/THUDM/GLM-Z1-9B-0414)
+Llama.cpp build: ced44be3 (5199)
 I used imatrix to create all these quants using this [Dataset](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8).
+---
+# Update-02
+  * [Fixed Quant] Re-quantized all quants to fix this issue. [#13099](https://github.com/ggml-org/llama.cpp/pull/13099) [#13140](https://github.com/ggml-org/llama.cpp/pull/13140)
+---
 |               | CPU (AVX2) | CPU (ARM NEON) | Metal | cuBLAS | rocBLAS | SYCL | CLBlast | Vulkan | Kompute |
 | :------------ | :---------: | :------------: | :---: | :----: | :-----: | :---: | :------: | :----: | :------: |