medmekk
/

quantized_int4

Model card Files Files and versions

quantized_int4 / README.md

medmekk's picture

medmekk HF Staff

Upload folder using huggingface_hub

2f9205d verified 11 months ago

|

history blame contribute delete

568 Bytes

	---
	base_model:
	- meta-llama/Meta-Llama-3-8B
	---

	# meta-llama/Meta-Llama-3-8B (Quantized)

	## Description
	This model is a quantized version of the original model `meta-llama/Meta-Llama-3-8B`. It has been quantized using int4_weight_only quantization with torchao.

	## Quantization Details
	- Quantization Type: int4_weight_only
	- Group Size: 128

	## Usage
	You can use this model in your applications by loading it directly from the Hugging Face Hub:

	```python
	from transformers import AutoModel

	model = AutoModel.from_pretrained("meta-llama/Meta-Llama-3-8B")