DeepSeek-R1-ThaiInsurance-COT-Demo1 - GGUF

About

For a convenient overview and download list, visit our model page.

If you are unsure how to use GGUF files, refer to the llama.cpp documentation for more details.

./llama-cli -m DeepSeek-R1-ThaiInsurance-COT-Demo1-q4_k_m.gguf -p "Hello!"

(sorted by size, not necessarily quality)

Link	Type	Size/GB	Notes
GGUF	q2_k	2.96	very low quality, for testing
GGUF	q3_k_m	3.74
GGUF	q4_0	4.34
GGUF	q4_k_m	4.58	recommended, good balance
GGUF	q5_k_m	5.34
GGUF	q8_0	7.95	near-full precision

Special thanks to the llama.cpp team for their amazing work.

GGUF

Model size

8B params

Architecture

llama

Hardware compatibility

2-bit

3-bit

4-bit

5-bit

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(1)

this model