RedHatAI/Qwen2.5-1.5B-Instruct-quantized.w8a8
2B
•
Updated
•
47
RedHatAI/SparseLlama-3-8B-pruned_50.2of4
Text Generation
•
8B
•
Updated
•
21
RedHatAI/Llama-3.2-90B-Vision-Instruct-FP8-dynamic
Text Generation
•
89B
•
Updated
•
2.92k
•
10
RedHatAI/Llama-3.2-11B-Vision-Instruct-FP8-dynamic
Text Generation
•
11B
•
Updated
•
1.33k
•
24
RedHatAI/Phi-3.5-mini-instruct-FP8-KV
Text Generation
•
4B
•
Updated
•
4
•
2
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w4a16
Text Generation
•
11B
•
Updated
•
1.58k
•
2
RedHatAI/SmolLM-135M-q
Updated
RedHatAI/Mixtral-8x22B-Instruct-v0.1-AutoFP8
Text Generation
•
141B
•
Updated
•
12
•
3
RedHatAI/DeepSeek-Coder-V2-Base-FP8
Text Generation
•
236B
•
Updated
•
9
RedHatAI/DeepSeek-Coder-V2-Instruct-FP8
Text Generation
•
236B
•
Updated
•
60
•
7
RedHatAI/Mistral-Nemo-Instruct-2407-FP8
Text Generation
•
12B
•
Updated
•
123k
•
18
RedHatAI/Qwen2-57B-A14B-Instruct-FP8
Text Generation
•
57B
•
Updated
•
1.28k
•
1
RedHatAI/Llama-2-7b-chat-hf-FP8
Text Generation
•
7B
•
Updated
•
1.04k
RedHatAI/Mistral-7B-Instruct-v0.3-FP8
Text Generation
•
7B
•
Updated
•
483
•
2
RedHatAI/Qwen2-0.5B-Instruct-FP8
Text Generation
•
0.5B
•
Updated
•
1.56k
•
3
RedHatAI/Qwen2-1.5B-Instruct-FP8
Text Generation
•
2B
•
Updated
•
9.11k
RedHatAI/Qwen2-7B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
18.2k
•
•
2
RedHatAI/Qwen2-72B-Instruct-FP8
Text Generation
•
73B
•
Updated
•
1.74k
•
15
RedHatAI/Mixtral-8x7B-Instruct-v0.1-AutoFP8
Text Generation
•
47B
•
Updated
•
63
•
3
RedHatAI/Meta-Llama-3-70B-Instruct-FP8
Text Generation
•
71B
•
Updated
•
1.78k
•
•
13
RedHatAI/Meta-Llama-3-8B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
5.27k
•
•
23
RedHatAI/DeepSeek-Coder-V2-Lite-Base-FP8
Text Generation
•
16B
•
Updated
•
20
RedHatAI/DeepSeek-Coder-V2-Lite-Instruct-FP8
Text Generation
•
16B
•
Updated
•
8.39k
•
7
RedHatAI/Qwen2-7B-Instruct-quantized.w4a16
Text Generation
•
2B
•
Updated
•
350
RedHatAI/Qwen2-72B-Instruct-quantized.w4a16
Text Generation
•
12B
•
Updated
•
345
•
4
RedHatAI/Qwen2-1.5B-Instruct-quantized.w4a16
Text Generation
•
0.6B
•
Updated
•
342
RedHatAI/Qwen2-0.5B-Instruct-quantized.w4a16
Text Generation
•
0.3B
•
Updated
•
7
RedHatAI/Qwen2-72B-Instruct-quantized.w8a16
Text Generation
•
20B
•
Updated
•
68
•
1
RedHatAI/Qwen2-7B-Instruct-quantized.w8a16
Text Generation
•
3B
•
Updated
•
19
RedHatAI/Qwen2-1.5B-Instruct-quantized.w8a16
Text Generation
•
0.6B
•
Updated
•
8