alexmarques commited on
Commit
305328a
·
verified ·
1 Parent(s): 4df0b9c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -26,7 +26,8 @@ tags:
26
  - **Input:** Text
27
  - **Output:** Text
28
  - **Model Optimizations:**
29
- - **Weight quantization:** INT4
 
30
  - **Intended Use Cases:** This model is designed to accelerate research on language models, for use as a building block for generative AI powered features. It provides uses for general purpose AI systems and applications (primarily in English) which require:
31
  1. Memory/compute constrained environments.
32
  2. Latency bound scenarios.
 
26
  - **Input:** Text
27
  - **Output:** Text
28
  - **Model Optimizations:**
29
+ - **Activation quantization:** FP8
30
+ - **Weight quantization:** FP8
31
  - **Intended Use Cases:** This model is designed to accelerate research on language models, for use as a building block for generative AI powered features. It provides uses for general purpose AI systems and applications (primarily in English) which require:
32
  1. Memory/compute constrained environments.
33
  2. Latency bound scenarios.