Bonsai Image: Compact AI Image Generation

State-of-the-art image generation, in your browser. Bonsai Image 4B is a compressed text-to-image model from PrismML, built for local generation on iPhone, Mac, and GPUs.

Bonsai Image is a family of low-bit image generation models designed to make high-quality AI image creation run closer to the user.

Instead of relying only on large cloud models, it compresses a 4B-class diffusion transformer so it can fit into everyday hardware with much lower memory pressure.

Model Overview

Item Details
Model name Bonsai Image 4B
Developer PrismML
Model type Text-to-image diffusion model
Base model FLUX.2 Klein 4B
Architecture MMDiT diffusion transformer
Main variants 1-bit Bonsai Image 4B, Ternary Bonsai Image 4B
Parameters About 4B transformer trunk
Native resolution 1024×1024, also supports 512×512 and other multiples of 32
Transformer size 0.93 GB for 1-bit, 1.21 GB for Ternary
Apple Silicon payload 3.42 GB for 1-bit, 3.88 GB for Ternary
License Apache 2.0
Supported deployment Apple Silicon, iPhone, iPad, Mac, CUDA GPUs
Main use case Local and private AI image generation

Features

Local Image Generation

Bonsai Image is built for on-device and local inference. It can run on Apple Silicon devices and compatible GPUs without sending every prompt to a cloud service.

Small Model Footprint

The model compresses the diffusion transformer from FLUX.2 Klein 4B into much smaller 1-bit and ternary versions. This lowers memory usage while keeping the model useful for real image generation.

Two Model Variants

The 1-bit version is the smaller, footprint-focused option. The Ternary version is larger but aims for better image quality and prompt fidelity.

Strong Quality-to-Size Tradeoff

Ternary Bonsai Image 4B is designed to stay close to FLUX.2 Klein 4B on benchmarks while using a much smaller transformer. It is not meant to beat the largest image models, but to make capable generation practical on smaller hardware.

Privacy-Friendly Workflow

Because generation can happen locally, prompts and generated images do not need to leave the device. This is useful for personal creative tools, enterprise workflows, and products with data residency concerns.

How to use

Enter a image description

how to use

set image paramter and click generater button

how to use

FAQ

Is Bonsai Image open source?

Bonsai Image 4B is released with open weights and code under the Apache 2.0 license.

What is Bonsai Image used for?

It is used for generating images from text prompts, especially in local apps, private creative tools, browser demos, and device-side image generation workflows.

What is the difference between 1-bit and Ternary Bonsai Image?

1-bit Bonsai Image is the smaller version, with a diffusion transformer under 1 GB. Ternary Bonsai Image is the quality-focused version, using slightly more space to improve visual quality and prompt following.

Can Bonsai Image run on an iPhone?

Yes. PrismML positions Bonsai Image 4B for Apple Silicon devices, including iPhone, iPad, and Mac. The official launch also mentions Bonsai Studio as an iOS app for trying the model.

Is Bonsai Image better than Stable Diffusion?

It depends on what you need. Bonsai Image is mainly interesting because it offers a strong quality-to-size tradeoff for local generation, not because it replaces every larger or specialized image model.

What are the limitations of Bonsai Image?

Image quality still depends on prompts and workflow. Fine text, exact object counts, tiny details, and strict composition may need testing before using it in production.