← Back to all generators

stability-ai/stable-diffusion

A latent text-to-image diffusion model capable of generating photo-realistic images given any text input

Capabilities

1:1 4:3 3:4 16:9 9:16 2:3 3:2 Negative Prompt Seed

Cost

Community model (estimated from hardware time)

Input Parameters

guidance_scale number

Scale for classifier-free guidance

Default: 7.5 min: 1, max: 20
height integer

Height of generated image in pixels. Needs to be a multiple of 64

Default: 768
64 128 192 256 320 384 448 512 576 640 704 768 832 896 960 1024
negative_prompt string

Specify things to not see in the output

num_inference_steps integer

Number of denoising steps

Default: 50 min: 1, max: 500
num_outputs integer

Number of images to generate.

Default: 1 min: 1, max: 4
prompt string

Input prompt

Default: "a vision of paradise. unreal engine"
scheduler string

Choose a scheduler.

Default: "DPMSolverMultistep"
DDIM K_EULER DPMSolverMultistep K_EULER_ANCESTRAL PNDM KLMS
seed integer

Random seed. Leave blank to randomize the seed

width integer

Width of generated image in pixels. Needs to be a multiple of 64

Default: 768
64 128 192 256 320 384 448 512 576 640 704 768 832 896 960 1024
Version: ac732df83cea Updated: 6/8/2026 110.9M runs