RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w8a16
Text Generation
•
19B
•
Updated
•
873
•
5
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
93.6k
•
42
RedHatAI/Mistral-7B-Instruct-v0.3-quantized.w8a8
Text Generation
•
7B
•
Updated
•
94
•
2
RedHatAI/Qwen2-72B-Instruct-quantized.w8a8
Text Generation
•
73B
•
Updated
•
4
•
1
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w8a8
Text Generation
•
71B
•
Updated
•
6
RedHatAI/Qwen2-7B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
20
RedHatAI/Phi-3-medium-128k-instruct-quantized.w4a16
Text Generation
•
2B
•
Updated
•
13.7k
•
3
RedHatAI/Qwen2-0.5B-Instruct-quantized.w8a8
Text Generation
•
0.6B
•
Updated
•
344
RedHatAI/Phi-3-mini-128k-instruct-quantized.w4a16
Text Generation
•
0.7B
•
Updated
•
40
•
1
RedHatAI/Qwen2-1.5B-Instruct-quantized.w8a8
Text Generation
•
2B
•
Updated
•
1.18k
RedHatAI/Phi-3-mini-128k-instruct-quantized.w8a8
Text Generation
•
4B
•
Updated
•
22
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
4.28k
•
2
RedHatAI/Llama-2-7b-chat-quantized.w8a8
Text Generation
•
7B
•
Updated
•
2.62k
•
1
RedHatAI/Phi-3-mini-128k-instruct-quantized.w8a16
Text Generation
•
1B
•
Updated
•
11
RedHatAI/Phi-3-mini-128k-instruct-FP8
Text Generation
•
4B
•
Updated
•
8
RedHatAI/Llama-3.2-3B-Instruct-FP8-dynamic
Text Generation
•
4B
•
Updated
•
1.18k
•
3
RedHatAI/Llama-3.2-1B-Instruct-FP8-dynamic
Text Generation
•
1B
•
Updated
•
37.8k
•
3
RedHatAI/gemma-2-9b-it-quantized.w8a8
Text Generation
•
10B
•
Updated
•
12
•
2
RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a8
Text Generation
•
14B
•
Updated
•
5
•
2
RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a16
Text Generation
•
4B
•
Updated
•
4
•
2
RedHatAI/Phi-3-medium-128k-instruct-FP8
Text Generation
•
14B
•
Updated
•
2.87k
•
5
RedHatAI/Qwen2.5-32B-Instruct-quantized.w8a16
9B
•
Updated
•
4
RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a16
3B
•
Updated
•
5
RedHatAI/Qwen2.5-0.5B-Instruct-quantized.w8a16
0.4B
•
Updated
•
4
RedHatAI/Qwen2.5-72B-Instruct-quantized.w8a8
73B
•
Updated
•
23
RedHatAI/Qwen2.5-32B-Instruct-quantized.w8a8
33B
•
Updated
•
117
RedHatAI/Llama-3.2-1B-FP8
1B
•
Updated
•
7
RedHatAI/Qwen2.5-32B-quantized.w8a8
33B
•
Updated
•
4
RedHatAI/Meta-Llama-3.1-405B-Instruct-FP8
Text Generation
•
406B
•
Updated
•
2.18k
•
31
RedHatAI/Qwen2.5-3B-Instruct-quantized.w8a8
3B
•
Updated
•
5