x/z-image-turbo

x/ z-image-turbo

174.8K Downloads Updated 6 months ago

Z-Image is a powerful and highly efficient image generation model.

image

ollama run x/z-image-turbo

Models

Name

3 models

Size / Usage

Context

Input

z-image-turbo:latest

13GB · - context window · Text · 6 months ago

z-image-turbo:latest

13GB

-

Text

z-image-turbo:fp8

latest

13GB · - context window · Text · 6 months ago

z-image-turbo:fp8 latest

13GB

-

Text

z-image-turbo:bf16

33GB · - context window · Text · 6 months ago

z-image-turbo:bf16

33GB

-

Text

Readme

Note: Image generation models currently only work on macOS.

Z-Image Turbo is a 6 billion parameter text-to-image model from Alibaba’s Tongyi Lab. It generates high-quality photorealistic images.

Features

Photorealistic output: Strong at generating realistic photographs, portraits, and scenes
Bilingual text rendering: Accurately renders both English and Chinese text in images
Apache 2.0: Open weights available for commercial use

Usage

ollama run x/z-image-turbo "a cat holding a sign that says hello world"

Variant	Size	Description
`fp8` (default)	13GB	FP8 quantized - best balance of quality and size
`bf16`	33GB	BF16 full precision - highest quality

Examples

Simple Prompts

Portrait

A chef in a busy kitchen, steam rising from pots

Landscape

Mountain lake at sunrise, pine trees, morning mist

Product Photos

White sneakers on concrete, overhead shot

Text rendering

A storefront sign that says "BAKERY" in gold letters

Detailed Prompts

Photorealistic portrait

Young woman in a cozy coffee shop, natural window lighting, wearing a cream knit sweater, holding a ceramic mug, soft bokeh background with warm ambient lights, candid moment, shot on 35mm film

Chinese text rendering

Traditional Chinese calligraphy brush painting style, the characters "山高水长" written in elegant black ink on rice paper, red seal stamp in corner, minimalist composition

Creative composition

Surreal double exposure portrait, woman's silhouette filled with blooming cherry blossom trees, soft pink and white petals floating, dreamy ethereal atmosphere, fine art photography

Best Practices

Use detailed, descriptive prompts (the model excels with rich descriptions)
For text in images, explicitly specify the text content in quotes and describe style/position
1024x1024 resolution is recommended
The model works well for photorealistic styles out of the box

Limitations

Complex scenes may occasionally have coherence issues
The model is not intended to provide factual information
Output quality is influenced by prompting style

License

This model uses weights from Z-Image-Turbo by Tongyi-MAI (Alibaba), released under the Apache 2.0 license.

> Note: Image generation models currently only work on macOS.

Z-Image Turbo is a 6 billion parameter text-to-image model from Alibaba's Tongyi Lab. It generates high-quality photorealistic images.

## Features

- **Photorealistic output**: Strong at generating realistic photographs, portraits, and scenes
- **Bilingual text rendering**: Accurately renders both English and Chinese text in images
- **Apache 2.0**: Open weights available for commercial use

## Usage

```
ollama run x/z-image-turbo "a cat holding a sign that says hello world"
```

| Variant | Size | Description |
|---------|------|-------------|
| `fp8` (default) | 13GB | FP8 quantized - best balance of quality and size |
| `bf16` | 33GB | BF16 full precision - highest quality |

![Ollama screenshot 2026-01-20 at 22.22.12@2x.png](/assets/x/z-image-turbo/244477ef-d81a-418c-af9a-d05c4c36f405)

## Examples

### Simple Prompts

**Portrait**
```
A chef in a busy kitchen, steam rising from pots
```
![Ollama screenshot 2026-01-19 at 16.36.42@2x.png](/assets/x/scratchpad/bc4e8ce3-b783-42ed-9e3f-73419dcfc5ee)

**Landscape**
```
Mountain lake at sunrise, pine trees, morning mist
```

![Ollama screenshot 2026-01-19 at 16.36.38@2x.png](/assets/x/scratchpad/a20edb3d-95d5-4889-ab12-d445c1aac0ff)

**Product Photos**
```
White sneakers on concrete, overhead shot
```
![Ollama screenshot 2026-01-19 at 16.37.01@2x.png](/assets/x/scratchpad/893f84e7-d5e2-4e28-aeba-6636132be8e2)

**Text rendering**
```
A storefront sign that says "BAKERY" in gold letters
```
![Ollama screenshot 2026-01-19 at 16.38.19@2x.png](/assets/x/scratchpad/d7ff9d7f-2479-4f5f-a3b9-c8d08757e327)

### Detailed Prompts

**Photorealistic portrait**
```
Young woman in a cozy coffee shop, natural window lighting, wearing a cream knit sweater, holding a ceramic mug, soft bokeh background with warm ambient lights, candid moment, shot on 35mm film
```
![Ollama screenshot 2026-01-19 at 16.39.27@2x.png](/assets/x/scratchpad/334440a7-a9fe-4440-9044-c8a51853c3ef)

**Chinese text rendering**
```
Traditional Chinese calligraphy brush painting style, the characters "山高水长" written in elegant black ink on rice paper, red seal stamp in corner, minimalist composition
```
![Ollama screenshot 2026-01-19 at 16.40.57@2x.png](/assets/x/scratchpad/40bf60ba-ed92-4848-9a31-94143a18d72f)

**Creative composition**
```
Surreal double exposure portrait, woman's silhouette filled with blooming cherry blossom trees, soft pink and white petals floating, dreamy ethereal atmosphere, fine art photography
```
![Ollama screenshot 2026-01-19 at 16.42.02@2x.png](/assets/x/scratchpad/faef220c-ab48-40f7-b28c-7108c863cf2f)

## Best Practices

- Use detailed, descriptive prompts (the model excels with rich descriptions)
- For text in images, explicitly specify the text content in quotes and describe style/position
- 1024x1024 resolution is recommended
- The model works well for photorealistic styles out of the box

## Limitations

- Complex scenes may occasionally have coherence issues
- The model is not intended to provide factual information
- Output quality is influenced by prompting style

## License

This model uses weights from Z-Image-Turbo by Tongyi-MAI (Alibaba), released under the [Apache 2.0 license](https://www.apache.org/licenses/LICENSE-2.0).

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)