llmware
/

qwen2.5-3b-instruct-ov

Model card Files Files and versions Community

qwen2.5-3b-instruct-ov / README.md

lbourdois's picture

Improve language tag

7144d69 verified 4 months ago

|

1.15 kB

	---
	license: apache-2.0
	inference: false
	base_model: Qwen/Qwen2.5-3B-Instruct
	base_model_relation: quantized
	tags:
	- green
	- llmware-chat
	- p3
	- ov
	- emerald
	language:
	- zho
	- eng
	- fra
	- spa
	- por
	- deu
	- ita
	- rus
	- jpn
	- kor
	- vie
	- tha
	- ara
	---

	# qwen2.5-3b-instruct-ov

	qwen2.5-3b-instruct-ov is an OpenVino int4 quantized version of [Qwen2.5-3B-Instruct](https://www.huggingface.co/Qwen/Qwen2.5-3B-Instruct), providing a very fast, very small inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.

	This is from the latest release series from Qwen.

	### Model Description

	- Developed by: Qwen
	- Quantized by: llmware
	- Model type: qwen2.5
	- Parameters: 3 billion
	- Model Parent: Qwen/Qwen2.5-3B-Instruct
	- Language(s) (NLP): English
	- License: Apache 2.0
	- Uses: Chat, general-purpose LLM
	- Quantization: int4


	## Model Card Contact

	[llmware on github](https://www.github.com/llmware-ai/llmware)

	[llmware on hf](https://www.huggingface.co/llmware)

	[llmware website](https://www.llmware.ai)