d-matrix
/

gpt2-medium

bmah-dmx commited on 1 day ago

Commit

b641742

verified ·

1 Parent(s): 8c1582a

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ The reference provides the following functional *configurations*:
   Configuration | Explanation
   :-- | :--
   **`BASELINE`** | a reference functionally equivalent to the original model
-  **`BASIC`** | all linear algebraic operands quantized to `MXINT8-64`, and all other operations transformed to approximated kernel simulations
 ### Usage
@@ -39,7 +39,9 @@ pip install -e .
 ```python
 from dmx.compressor.modeling import DmxModel
 import lm_eval
 model_args = "pretrained=d-matrix/gpt2-medium,trust_remote_code=True"
 lm = lm_eval.api.registry.get_model("hf").create_from_arg_string(model_args, {"batch_size": 1})

   Configuration | Explanation
   :-- | :--
   **`BASELINE`** | a reference functionally equivalent to the original model
+  **`BASIC`** | all linear algebraic operands quantized to `MXINT8-64`
 ### Usage
 ```python
 from dmx.compressor.modeling import DmxModel
 import lm_eval
+from lm_eval.models.huggingface import HFLM
+lm_eval.api.registry.register_model("hf", HFLM)
 model_args = "pretrained=d-matrix/gpt2-medium,trust_remote_code=True"
 lm = lm_eval.api.registry.get_model("hf").create_from_arg_string(model_args, {"batch_size": 1})