ISTA-DASLab
/

Llama-3.1-8B-Instruct-GGUF

Model card Files Files and versions

helcig commited on Sep 11

Commit

ac11b29

·

verified ·

1 Parent(s): db39038

Update README.md

Files changed (1) hide show

README.md +0 -6

README.md CHANGED Viewed

@@ -26,12 +26,6 @@ Our GPTQ-based quantization methods achieve **superior quality-compression trade
 - **Error-correcting updates** during calibration for improved accuracy
 - **Optimized configurations** that allocate bits based on layer sensitivity (EvoPress)
-| Method | Avg Bits | C4 PPL | WikiText2 PPL |
-|--------|----------|--------|---------------|
-| GPTQ-4 | 4.50 | 11.35 | 6.89 |
-| EvoPress-GPTQ-4 | 4.50 | 11.35 | 6.89 |
-| EvoPress-GPTQ-5 | 5.51 | 11.13 | 6.79 |
 ## Usage
 Compatible with llama.cpp and all GGUF-supporting inference engines. No special setup required.

 - **Error-correcting updates** during calibration for improved accuracy
 - **Optimized configurations** that allocate bits based on layer sensitivity (EvoPress)
 ## Usage
 Compatible with llama.cpp and all GGUF-supporting inference engines. No special setup required.