|
|
--- |
|
|
license: apache-2.0 |
|
|
language: |
|
|
- en |
|
|
- zh |
|
|
base_model: |
|
|
- Tongyi-MAI/Z-Image-Turbo |
|
|
pipeline_tag: text-to-image |
|
|
tags: |
|
|
- text-to-image |
|
|
- image-generation |
|
|
- diffusion |
|
|
- comfyui |
|
|
- photorealistic |
|
|
- bilingual |
|
|
- chinese |
|
|
- english |
|
|
- 8-step |
|
|
- fast-generation |
|
|
--- |
|
|
|
|
|
# ๐ Z-Image-Turbo Collection |
|
|
|
|
|
**Ultra-Fast 8-Step Generation โข Photorealistic & Anime โข All-in-One** |
|
|
|
|
|
[](https://opensource.org/licenses/Apache-2.0) |
|
|
[](https://github.com/comfyanonymous/ComfyUI) |
|
|
|
|
|
--- |
|
|
|
|
|
## ๐ฆ Models |
|
|
|
|
|
| Model | Style | FP8 | BF16 | Best Sampler | |
|
|
|-------|-------|-----|------|--------------| |
|
|
| ๐ธ **Z-Image-Turbo-AIO** | Photorealistic | ~10GB | ~20GB | res_multistep + simple | |
|
|
| ๐จ **Z-Image-Turbo-Anime** | Anime | ~10GB | ~20GB | euler_ancestral + beta | |
|
|
|
|
|
**All models:** 8-9 steps โข CFG 1.0 โข 8GB VRAM compatible |
|
|
|
|
|
--- |
|
|
|
|
|
## ๐ง Workflows |
|
|
|
|
|
| Workflow | File | Best For | |
|
|
|----------|------|----------| |
|
|
| ๐ [Standard v1](#-standard-v10--v20) | `ZIT-AIO-v1.0.json` | Simple text-to-image | |
|
|
| ๐ [Standard v2](#-standard-v10--v20) | `ZIT-AIO-v2.0.json` | Improved upscaler | |
|
|
| ๐ฎ [ControlNet](#-controlnet) | `ZIT-AIO-Control.json` | Guided generation | |
|
|
| ๐ฒ [Seed Variance](#-seed-variance-enhancer) | `ZIT-AIO-Variance.json` | Output diversity | |
|
|
| ๐ฌ [SeedVR2](#-seedvr2) | `ZIT-AIO-SeedVR2.json` | Pro upscaling | |
|
|
| ๐ [Depth V3](#-depth-anything-v3) | `ZIT-AIO-DepthV3.json` | Depth control + 3D | |
|
|
| ๐จ [Anime](#-anime-workflow) | `ZIT-AIO-Anime.json` | Anime generation | |
|
|
|
|
|
--- |
|
|
|
|
|
## ๐ธ Z-Image-Turbo-AIO (Photorealistic) |
|
|
|
|
|
Alibaba Tongyi Lab's 6B parameter photorealistic generator with integrated VAE + Text Encoder. |
|
|
|
|
|
**Features:** โก 8-step โข ๐ Bilingual EN/CN โข ๐ธ Professional quality |
|
|
|
|
|
```yaml |
|
|
Steps: 9 | CFG: 1.0 | Sampler: res_multistep | Scheduler: simple |
|
|
``` |
|
|
|
|
|
๐ผ๏ธ [CivitAI Page](https://civitai.com/models/2173571?modelVersionId=2448013) |
|
|
|
|
|
--- |
|
|
|
|
|
## ๐จ Z-Image-Turbo-Anime (NEW!) |
|
|
|
|
|
Custom anime checkpoint with merged LoRAs for enhanced anime aesthetics. |
|
|
|
|
|
**Features:** โก 8-step โข ๐จ Enhanced anime style โข ๐ Partially NSFW capable |
|
|
|
|
|
```yaml |
|
|
Steps: 8-9 | CFG: 1.0 | Sampler: euler_ancestral | Scheduler: beta |
|
|
``` |
|
|
|
|
|
๐ผ๏ธ [CivitAI Page](https://civitai.com/models/2259646/z-image-turbo-anime?modelVersionId=2543657) |
|
|
|
|
|
--- |
|
|
|
|
|
## ๐ Workflow Details |
|
|
|
|
|
<details> |
|
|
<summary><b>๐ Standard v1.0 & v2.0</b> โ Simple text-to-image</summary> |
|
|
|
|
|
### Standard Workflow |
|
|
|
|
|
 |
|
|
|
|
|
Simple text-to-image with upscaler and dual sampler options. |
|
|
|
|
|
**v2.0 Improvements:** |
|
|
- Enhanced upscaler with denoise control (0.4-0.6) |
|
|
- Dual sampler: `res_multistep` or `euler_ancestral` |
|
|
- Dual scheduler: `simple` or `beta` |
|
|
- `scale_by` parameter for output size |
|
|
|
|
|
**Required Nodes:** rgthree-comfy, comfyui_image_metadata_extension |
|
|
|
|
|
 |
|
|
|
|
|
๐ฅ **Download:** [ZIT-AIO-v1.0.json](workflows/ZIT-AIO-v1.0.json) | [ZIT-AIO-v2.0.json](workflows/ZIT-AIO-v2.0.json) |
|
|
|
|
|
</details> |
|
|
|
|
|
<details> |
|
|
<summary><b>๐ฎ ControlNet</b> โ Guided generation with reference images</summary> |
|
|
|
|
|
### ControlNet Union Workflow |
|
|
|
|
|
 |
|
|
|
|
|
5 control types for precise composition control. |
|
|
|
|
|
**Control Types:** |
|
|
- Canny โ Edge detection |
|
|
- HED โ Soft edges |
|
|
- Depth โ 3D depth |
|
|
- Pose โ Body pose |
|
|
- MLSD โ Line detection |
|
|
|
|
|
**Settings:** |
|
|
- ControlNet strength: 0.6-0.8 |
|
|
- Megapixel scaling (auto aspect ratio) |
|
|
|
|
|
**Required:** |
|
|
- ControlNet Union file โ `ComfyUI/models/model_patches/` |
|
|
- Download: [ControlNet Union](https://huggingface.co/alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union) |
|
|
|
|
|
**Required Nodes:** rgthree-comfy, comfyui_image_metadata_extension, comfyui_controlnet_aux |
|
|
|
|
|
 |
|
|
|
|
|
๐ฅ **Download:** [ZIT-AIO-Control.json](workflows/ZIT-AIO-Control.json) |
|
|
|
|
|
</details> |
|
|
|
|
|
<details> |
|
|
<summary><b>๐ฒ Seed Variance Enhancer</b> โ More diversity from same prompt</summary> |
|
|
|
|
|
### Seed Variance Enhancer Workflow |
|
|
|
|
|
 |
|
|
|
|
|
Adds controlled noise to text embeddings for more varied outputs. |
|
|
|
|
|
**Settings:** |
|
|
```yaml |
|
|
randomize_percent: 50 |
|
|
strength: 20-30 |
|
|
noise_insert: 'noise on beginning steps' |
|
|
steps_switchover_percent: 20 |
|
|
``` |
|
|
|
|
|
**Trade-off:** Higher values = more diversity but less prompt adherence |
|
|
|
|
|
**Required Nodes:** rgthree-comfy, comfyui_image_metadata_extension, [SeedVarianceEnhancer](https://github.com/ChangeTheConstants/SeedVarianceEnhancer) |
|
|
|
|
|
 |
|
|
|
|
|
๐ฅ **Download:** [ZIT-AIO-Variance.json](workflows/ZIT-AIO-Variance.json) |
|
|
|
|
|
</details> |
|
|
|
|
|
<details> |
|
|
<summary><b>๐ฌ SeedVR2</b> โ Professional diffusion upscaling</summary> |
|
|
|
|
|
### SeedVR2 Video Upscaler Workflow |
|
|
|
|
|
 |
|
|
|
|
|
DiT-based upscaling with superior quality and temporal consistency. |
|
|
|
|
|
**Settings:** |
|
|
```yaml |
|
|
resolution: 1536 (target short edge) |
|
|
batch_size: 4n+1 (1, 5, 9, 13, 17...) |
|
|
color_correction: 'lab' |
|
|
``` |
|
|
|
|
|
**Models:** 3B (faster) or 7B (higher quality) โข FP16/FP8/GGUF options |
|
|
|
|
|
**Note:** Models auto-download to `ComfyUI/models/SEEDVR2/` |
|
|
|
|
|
**Required Nodes:** rgthree-comfy, comfyui_image_metadata_extension, [SeedVR2](https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler) |
|
|
|
|
|
 |
|
|
|
|
|
๐ฅ **Download:** [ZIT-AIO-SeedVR2.json](workflows/ZIT-AIO-SeedVR2.json) |
|
|
|
|
|
</details> |
|
|
|
|
|
<details> |
|
|
<summary><b>๐ Depth Anything V3</b> โ Depth control + 3D preview</summary> |
|
|
|
|
|
### Depth Anything V3 ControlNet Workflow |
|
|
|
|
|
 |
|
|
|
|
|
State-of-the-art depth-guided generation with dual modes. |
|
|
|
|
|
**Two Modes:** |
|
|
1. **Generate** โ Depth-controlled image generation |
|
|
2. **3D Preview** โ View depth as 3D point cloud |
|
|
|
|
|
**Models:** da3_small / da3_base / da3_large / da3_giant |
|
|
- 8GB VRAM: Use up to `da3_large` |
|
|
|
|
|
**Note:** Models auto-download on first use |
|
|
|
|
|
**Required Nodes:** rgthree-comfy, comfyui_image_metadata_extension, [Depth Anything 3](https://github.com/ByteDance-Seed/Depth-Anything-3) |
|
|
|
|
|
 |
|
|
|
|
|
๐ฅ **Download:** [ZIT-AIO-DepthV3.json](workflows/ZIT-AIO-DepthV3.json) |
|
|
|
|
|
</details> |
|
|
|
|
|
<details> |
|
|
<summary><b>๐จ Anime Workflow</b> โ For Z-Image-Turbo-Anime</summary> |
|
|
|
|
|
### Z-Image-Turbo-Anime Workflow |
|
|
|
|
|
 |
|
|
|
|
|
Optimized workflow for the anime checkpoint with integrated features. |
|
|
|
|
|
**Includes:** |
|
|
- Seed Variance Enhancer for diversity |
|
|
- SeedVR2 as alternative upscaler |
|
|
- Toggle switches for optional features |
|
|
- Image comparison nodes |
|
|
|
|
|
**Best Settings:** |
|
|
```yaml |
|
|
Sampler: euler_ancestral |
|
|
Scheduler: beta |
|
|
Resolution: 832ร1216 (portrait) / 1216ร832 (landscape) |
|
|
``` |
|
|
|
|
|
**Required Nodes:** rgthree-comfy, comfyui_image_metadata_extension, SeedVarianceEnhancer, SeedVR2 |
|
|
|
|
|
 |
|
|
|
|
|
๐ฅ **Download:** [ZIT-AIO-Anime.json](workflows/ZIT-AIO-Anime.json) |
|
|
|
|
|
</details> |
|
|
|
|
|
--- |
|
|
|
|
|
## ๐ FP8 vs BF16 |
|
|
|
|
|
| | ๐ก FP8 (~10GB) | ๐ BF16 (~20GB) | |
|
|
|---|----------------|-----------------| |
|
|
| **Quality** | 95% | 100% | |
|
|
| **Speed** | Slightly faster | Standard | |
|
|
| **Download** | Faster | Slower | |
|
|
| **Best For** | Daily use | Final renders | |
|
|
|
|
|
**Both work on 8GB VRAM!** |
|
|
|
|
|
--- |
|
|
|
|
|
## โ๏ธ Common Settings |
|
|
|
|
|
```yaml |
|
|
# Essential (ALL workflows) |
|
|
Steps: 8-9 |
|
|
CFG: 1.0 # Don't change! |
|
|
Negative Prompt: NONE (use ConditioningZeroOut) |
|
|
|
|
|
# Photorealistic |
|
|
Sampler: res_multistep |
|
|
Scheduler: simple |
|
|
|
|
|
# Anime |
|
|
Sampler: euler_ancestral |
|
|
Scheduler: beta |
|
|
``` |
|
|
|
|
|
--- |
|
|
|
|
|
## ๐ก Prompting |
|
|
|
|
|
**Natural language works best!** (100-300 words) |
|
|
|
|
|
``` |
|
|
โ
Professional food photography of artisan breakfast plate. |
|
|
Golden poached eggs on sourdough toast, crispy bacon, fresh |
|
|
avocado slices. Morning sunlight creating warm glow. |
|
|
|
|
|
โ breakfast, eggs, toast, bacon, food photo, delicious |
|
|
``` |
|
|
|
|
|
**Bilingual Text Rendering:** |
|
|
``` |
|
|
English: Neon sign reading "COFFEE SHOP" in bright blue letters |
|
|
Chinese: Sign with "ๅๅกๅบ" in elegant gold calligraphy |
|
|
``` |
|
|
|
|
|
--- |
|
|
|
|
|
## ๐ฅ Downloads |
|
|
|
|
|
### Models |
|
|
|
|
|
| Source | Link | |
|
|
|--------|------| |
|
|
| ๐ผ๏ธ CivitAI (Photorealistic) | [civitai.com/models/2173571](https://civitai.com/models/2173571?modelVersionId=2448013) | |
|
|
| ๐ผ๏ธ CivitAI (Anime) | [civitai.com/models/2259646](https://civitai.com/models/2259646/z-image-turbo-anime?modelVersionId=2543657) | |
|
|
| ๐ค HuggingFace | This page | |
|
|
| ๐ฎ TensorArt | [tensorart.me](https://tensorart.me/de-DE/models/939302608026244936/Z-Image-Turbo-AIO-FP8-FP8) | |
|
|
|
|
|
### Additional Files |
|
|
|
|
|
| File | Link | |
|
|
|------|------| |
|
|
| ControlNet Union | [HuggingFace](https://huggingface.co/alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union) | |
|
|
| SeedVR2 Models | Auto-downloads on first use | |
|
|
| Depth Anything V3 | Auto-downloads on first use | |
|
|
|
|
|
--- |
|
|
|
|
|
## ๐ฆ Custom Nodes |
|
|
|
|
|
<details> |
|
|
<summary><b>All Required Nodes</b></summary> |
|
|
|
|
|
**Base (all workflows):** |
|
|
- [rgthree-comfy](https://github.com/rgthree/rgthree-comfy) |
|
|
- [comfyui_image_metadata_extension](https://github.com/edelvarden/comfyui_image_metadata_extension) |
|
|
|
|
|
**ControlNet Workflow:** |
|
|
- [comfyui_controlnet_aux](https://github.com/Fannovel16/comfyui_controlnet_aux) |
|
|
|
|
|
**Seed Variance / Anime Workflow:** |
|
|
- [SeedVarianceEnhancer](https://github.com/ChangeTheConstants/SeedVarianceEnhancer) |
|
|
|
|
|
**SeedVR2 / Anime Workflow:** |
|
|
- [SeedVR2](https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler) |
|
|
|
|
|
**Depth V3 Workflow:** |
|
|
- [Depth Anything 3](https://github.com/ByteDance-Seed/Depth-Anything-3) |
|
|
|
|
|
</details> |
|
|
|
|
|
--- |
|
|
|
|
|
## ๐ Files |
|
|
|
|
|
``` |
|
|
โโโ z-image-turbo-fp8-aio.safetensors # ๐ธ Photorealistic FP8 |
|
|
โโโ z-image-turbo-bf16-aio.safetensors # ๐ธ Photorealistic BF16 |
|
|
โโโ z-image-anime-aio-fp8.safetensors # ๐จ Anime FP8 |
|
|
โโโ z-image-anime-aio-bf16.safetensors # ๐จ Anime BF16 |
|
|
โ |
|
|
โโโ workflows/ |
|
|
โ โโโ ZIT-AIO-v1.0.json # ๐ Standard v1 |
|
|
โ โโโ ZIT-AIO-v2.0.json # ๐ Standard v2 |
|
|
โ โโโ ZIT-AIO-Control.json # ๐ฎ ControlNet |
|
|
โ โโโ ZIT-AIO-Variance.json # ๐ฒ Seed Variance |
|
|
โ โโโ ZIT-AIO-SeedVR2.json # ๐ฌ SeedVR2 |
|
|
โ โโโ ZIT-AIO-DepthV3.json # ๐ Depth V3 |
|
|
โ โโโ ZIT-AIO-Anime.json # ๐จ Anime |
|
|
โ |
|
|
โโโ images/ |
|
|
โ โโโ test1-coffee-shop.jpeg # Photo examples |
|
|
โ โโโ test2-chinese-temple.jpeg |
|
|
โ โโโ test3-sushi.jpeg |
|
|
โ โโโ workflows/ # Workflow images |
|
|
โ โโโ v1-title.png & v1-screenshot.png |
|
|
โ โโโ v2-title.png & v2-screenshot.png |
|
|
โ โโโ control-title.png & control-screenshot.png |
|
|
โ โโโ variance-title.png & variance-screenshot.png |
|
|
โ โโโ seedvr2-title.png & seedvr2-screenshot.png |
|
|
โ โโโ depthv3-title.png & depthv3-screenshot.png |
|
|
โ โโโ anime-title.png & anime-screenshot.png |
|
|
``` |
|
|
|
|
|
--- |
|
|
|
|
|
## ๐ Credits |
|
|
|
|
|
**Original Model:** Tongyi Lab (Alibaba) โข 6B S3-DiT โข Apache 2.0 |
|
|
|
|
|
**AIO Conversion & Anime Merge:** [SeeSee21](https://huggingface.co/SeeSee21) |
|
|
|
|
|
**Resources:** |
|
|
[Original HuggingFace](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo) โข |
|
|
[GitHub](https://github.com/Tongyi-MAI/Z-Image) โข |
|
|
[ComfyUI Files](https://huggingface.co/Comfy-Org/z_image_turbo) |
|
|
|
|
|
--- |
|
|
|
|
|
**Download โ Load Checkpoint โ Generate!** ๐ |
|
|
--- |
|
|
|
|
|
<div align="center"> |
|
|
|
|
|
**Download, load with "Load Checkpoint", and generate professional photos in seconds! ๐** |
|
|
|
|
|
</div> |