This is a quantization of Comfy-Org/z_image_turbo to FP8_E5M2 and FP8_E4M3FN

Precision Image 1 Image 2
bf16
fp8_e4m3fn

⚡️- Image
An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Official Site  GitHub  Hugging Face  Hugging Face  ModelScope Model  ModelScope Space  Art Gallery PDF  Web Art Gallery 

Welcome to the official repository for the Z-Image(造相)project!

✨ Z-Image

Z-Image is a powerful and highly efficient image generation model with 6B parameters. It is currently has three variants:

  • 🚀 Z-Image-Turbo – A distilled version of Z-Image that matches or exceeds leading competitors with only 8 NFEs (Number of Function Evaluations). It offers ⚡️sub-second inference latency⚡️ on enterprise-grade H800 GPUs and fits comfortably within 16G VRAM consumer devices. It excels in photorealistic image generation, bilingual text rendering (English & Chinese), and robust instruction adherence.

  • 🧱 Z-Image-Base – The non-distilled foundation model. By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.

  • ✍️ Z-Image-Edit – A variant fine-tuned on Z-Image specifically for image editing tasks. It supports creative image-to-image generation with impressive instruction-following capabilities, allowing for precise edits based on natural language prompts.

📥 Model Zoo

Model Hugging Face ModelScope
Z-Image-Turbo Hugging Face
Hugging Face Space
ModelScope Model
ModelScope Space
Z-Image-Base To be released To be released
Z-Image-Edit To be released To be released
Downloads last month
4,248
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support