Image Generation Node - Builder Studio Docs

The Image Generation node creates a new image from a text prompt and an optional reference image. It is provider-bound: each node carries a provider preset that routes the request to one of the two image backends, and the generated result is stored on the canvas as a synced media asset.

Generation is separate from the Image node

Use Image Generation to synthesize a new image. Use the Image node to store, crop, export, or reuse an image that already exists on the canvas. The generated output is written as a downstream media asset that downstream nodes consume like any other image.

What it does

Resolves a prompt from the node field or a connected text input.
Resolves the provider and model from the node's preset, then posts a generation request to the matching API route.
Resolves a single connected reference image when the selected model supports image input.
Decodes the returned image, stores it durably, and emits it on the image-out port.

Providers

Image generation is wired to two providers. The node preset selects which one a given node uses.

Provider	Route	Models
Google	`/api/generate-image`	Gemini image generation through the Google GenAI SDK. The default model is configured server-side; the node may also send an explicit model id.
fal	`/api/fal/generate-image`	FLUX.2 and other fal endpoints. The default endpoint is `fal-ai/flux-2` when none is provided.

Two providers, not five

Image and video generation resolve only the google and fal providers in the current build. Other vendor logos that appear elsewhere in the product are not active image-generation backends today.

fal models

These fal image endpoints have first-class request contracts. Endpoints not in the contract list are still accepted and sent to fal as-is, using an image_size sizing mode and an image_url reference field by default.

Model	Endpoint id	Notes
FLUX.2 Dev	`fal-ai/flux-2`	Default fal image endpoint. High quality, configurable.
FLUX.2 Flash	`fal-ai/flux-2/flash`	Fast text-to-image.
FLUX.2 Pro	`fal-ai/flux-2-pro`	Balanced quality and speed.
Seedream 4.5	`fal-ai/bytedance/seedream/v4.5/text-to-image`	High-quality unified image generation.
Recraft V4.1	`fal-ai/recraft/v4.1/text-to-image`	Design-focused generation with color controls.

Inputs

Prompt — required. Read from the node prompt field or a connected text source. Prompts must be under the shared maximum prompt length.
Reference image — optional. The first connected visual input is resolved and passed to the model when the model supports image input. fal endpoints without a reference image field reject reference input with a 400.
Aspect ratio and image size — node fields, shaped per provider. fal uses either an image_size preset or an aspect_ratio string depending on the endpoint contract.
Model parameters — endpoint-specific extras such as guidance_scale, num_inference_steps, and seed. Unsupported keys for the chosen endpoint are filtered out before the request is sent.

fal image size presets

The fal route accepts these imageSize presets: square_hd, square, portrait_4_3, portrait_16_9, landscape_4_3, and landscape_16_9. Aspect ratio and image size are top-level request fields, not model parameters.

Outputs

On success the node emits the generated image on the image-out port and stores it as durable media so downstream nodes and later sessions can reuse it. If the prompt looks like JSON, HTML, or source code rather than a visual description, the node attaches a prompt-quality warning to the result instead of failing.

Run a generation

Add the node and set a prompt
Add an AI Image node, then type a prompt or connect a text source. The node opens its configuration panel automatically so you can pick a model and sizing.
Choose provider, model, and size
The provider preset determines the backend route. Select an aspect ratio or image size and any supported model parameters. Connect a reference image if the model accepts one.
Run the node
Google requests return inline. fal requests are submitted to the fal queue and polled to completion, which tolerates client disconnects during longer jobs.
Reuse the output
The stored image flows out of image-out. Wire it into a Video Generation node, an audio workflow, or an Image node for cropping and export.

Async and polling (fal)

The fal image route defaults to the asynchronous queue path. The Builder Studio executor submits the job, receives a request id and a signed status token, then suspends and polls the fal status endpoint until the image is ready. Google image generation runs inline within the request.

Agent and API notes

Agents drive image generation by setting the node prompt and provider preset, then running the node. The example below is a direct fal route body. Reserved internal keys are stripped, unknown model parameters for the chosen endpoint are dropped, and the request is bound to a canvas for credential and access scoping.

1{2  "prompt": "a calm desert at golden hour, cinematic",3  "model": "fal-ai/flux-2",4  "imageSize": "landscape_16_9",5  "modelParams": {6    "guidance_scale": 3.5,7    "num_inference_steps": 28,8    "seed": 429  }10}

Was this page helpful?

What it does

Providers

fal models

Inputs

Outputs

Run a generation

Add the node and set a prompt

Choose provider, model, and size

Run the node

Reuse the output

Async and polling (fal)

Agent and API notes