Z-Image Turbo FP8 [Kijai]
About this model
Text Encoder | VAE
Early support for LoRA training (Turbo) | Prompt Guide
If you're already using SDXL, Flux, or Qwen-based generators, here's the simple version of what Z-Image-Turbo is:
It's a super-fast text-to-image model from Alibaba that spits out 1024×1024 pictures in less than a second on a single high-end card (or a couple of seconds on a 3090/4090). Speed comes from heavy distillation , it’s basically a 6B model taught by much bigger internal beasts, so it feels closer to closed top-tier models than most open-source stuff.
Compared to what you know:
Faster than Flux and SDXL (the SD3.5 Turbo is a bit faster in my testing.)
Better prompt following and prettier results than most current open models
Renders English and Chinese text in images almost perfectly
Currently sitting at #1 on the public human-voted leaderboard (AI Arena Elo)
Tongyi-MAI HF | GitHub
The model is mirrored here for convenience.
Tags
Related Models
Similar AI models you may like
Juggernaut XL
Pony Diffusion V6 XL
CyberRealistic Pony
CyberRealistic
epiCRealism XL
Nova Anime XL
Realism By Stable Yogi (Pony)