wan-video/wan-2.7-image-pro
Generate and edit high-quality images with Alibaba's Wan 2.7 Pro with 4K output, thinking mode, text-to-image, multi-image editing, and image set generation
Capabilities
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
prompt * | string | Text prompt for image generation or editing | — | — |
image_set_mode | boolean | Generate a coherent set of related images from a single prompt (e.g. a character across seasons). When enabled, num_outputs can be up to 12. | false | — |
images | array | Input images for editing, style transfer, or multi-reference generation (up to 9 images, jpg/png/bmp/webp). When provided, the model operates in image editing mode. | | — |
num_outputs | integer | Number of images to generate (1-4 for standard mode, 1-12 for image set mode) | 1 | min: 1, max: 4 |
seed | integer | Random seed for reproducible generation. Range: 0-2147483647 | — | — |
size | string | Output image resolution. '1K', '2K', and '4K' auto-size based on input images. 4K is only available for text-to-image. | "2K" | 1K 2K 4K 1024*1024 2048*2048 4096*4096 1280*720 720*1280 2048*1152 1152*2048 4096*2304 2304*4096 1024*768 768*1024 2048*1536 1536*2048 4096*3072 3072*4096 |
thinking_mode | boolean | Enable enhanced reasoning for improved image quality. Only applies to text-to-image (no input images, no image set mode). Increases generation time. | true | — |
prompt required string Text prompt for image generation or editing
image_set_mode boolean Generate a coherent set of related images from a single prompt (e.g. a character across seasons). When enabled, num_outputs can be up to 12.
false images array Input images for editing, style transfer, or multi-reference generation (up to 9 images, jpg/png/bmp/webp). When provided, the model operates in image editing mode.
num_outputs integer Number of images to generate (1-4 for standard mode, 1-12 for image set mode)
1 min: 1, max: 4 seed integer Random seed for reproducible generation. Range: 0-2147483647
size string Output image resolution. '1K', '2K', and '4K' auto-size based on input images. 4K is only available for text-to-image.
"2K" thinking_mode boolean Enable enhanced reasoning for improved image quality. Only applies to text-to-image (no input images, no image set mode). Increases generation time.
true d880bad3fb10 Updated: 6/26/2026 88.5K runs
cinemasetfree.com