metadata
license: other
license_name: stabilityai-ai-community
license_link: LICENSE.md
tags:
- text-to-image
- stable-diffusion
- diffusers
inference: true
language:
- en
pipeline_tag: text-to-image
Stable Diffusion 3.5 Medium ONNX
This ONNX version of Stable Diffusion 3.5 Medium was made from the PyTorch source model, using optimum-cli
: Converting Stable Diffusion 3.5 Medium From PyTorch to ONNX.
Usage
Python Gradio: Stable Diffusion 3.5 Inpainting in ONNX
Model
Stable Diffusion 3.5 Medium is a Multimodal Diffusion Transformer with improvements (MMDiT-X) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.
Please note: This model is released under the Stability Community License. Visit Stability AI to learn or contact us for commercial licensing details.
Model Description
- Developed by: Stability AI
- Model type: MMDiT-X text-to-image generative model
- Model Description: This model generates images based on text prompts. It is a Multimodal Diffusion Transformer (https://arxiv.org/abs/2403.03206) with improvements that use three fixed, pretrained text encoders, with QK-normalization to improve training stability, and dual attention blocks in the first 12 transformer layers.
License
- Community License: Free for research, non-commercial, and commercial use for organizations or individuals with less than $1M in total annual revenue. More details can be found in the Community License Agreement. Read more at https://stability.ai/license.
- For individuals and organizations with annual revenue above $1M: please contact us to get an Enterprise License.