AI & ML interests
LLMs, optimization, compression, sparsification, quantization, pruning, distillation, NLP, CV
-
RedHatAI/DeepSeek-R1-Distill-Llama-8B-FP8-dynamic
Text Generation • 8B • Updated • 1.54k • 4 -
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w8a8
Text Generation • 71B • Updated • 3.24k • 2 -
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w4a16
Text Generation • 2B • Updated • 1.84k -
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w8a8
Text Generation • 8B • Updated • 1.32k • 2
-
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8
Text Generation • 3B • Updated • 1.91k -
RedHatAI/granite-3.1-8b-instruct-FP8-dynamic
Text Generation • 8B • Updated • 902 • 1 -
RedHatAI/granite-3.1-2b-instruct-quantized.w4a16
Text Generation • 0.5B • Updated • 63 -
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16
Text Generation • 1B • Updated • 1.29k • 1
-
RedHatAI/DeepSeek-R1-Distill-Llama-8B-FP8-dynamic
Text Generation • 8B • Updated • 1.54k • 4 -
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w8a8
Text Generation • 71B • Updated • 3.24k • 2 -
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w4a16
Text Generation • 2B • Updated • 1.84k -
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w8a8
Text Generation • 8B • Updated • 1.32k • 2
-
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8
Text Generation • 3B • Updated • 1.91k -
RedHatAI/granite-3.1-8b-instruct-FP8-dynamic
Text Generation • 8B • Updated • 902 • 1 -
RedHatAI/granite-3.1-2b-instruct-quantized.w4a16
Text Generation • 0.5B • Updated • 63 -
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16
Text Generation • 1B • Updated • 1.29k • 1