File size: 1,043 Bytes
93cf987 8198729 c3e1b9c 8198729 93cf987 8198729 93cf987 8198729 93cf987 8198729 93cf987 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 |
---
language:
- en
base_model:
- openai/gpt-oss-120b
- unsloth/gpt-oss-120b-BF16
pipeline_tag: text-generation
tags:
- gpt_oss
- vllm
- fp8
- conversational
- compressed-tensors
- text-generation-inference
license: apache-2.0
license_name: apache-2.0
name: RedHatAI/gpt-oss-120b-FP8-Dynamic
description: This model was obtained by quantizing the weights and activations to FP8 data type. It is based on the gpt-oss-120b-BF16 architecture.
readme: https://huggingface.co/RedHatAI/gpt-oss-120b-FP8-Dynamic/main/README.md
tasks:
- text-to-text
provider: RedHatAI
license_link: https://www.apache.org/licenses/LICENSE-2.0
---
# RedHatAI/gpt-oss-120b-FP8-Dynamic
## Model Overview
- **Model Architecture:** gpt-oss-120b-BF16
- **Input:** Text
- **Output:** Text
- **Model Optimizations:**
- **Weight quantization:** FP8
- **Activation quantization:** FP8
- **Release Date:** 08/13/2025
- **Version:** 1.0
- **Model Developers:** RedHatAI
Quantized version of [unsloth/gpt-oss-120b-BF16](https://huggingface.co/unsloth/gpt-oss-120b-BF16). |