vidu/q3-pro
High-fidelity video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with synchronized audio.
Capabilities
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
prompt * | string | Text prompt for video generation. Maximum 5000 characters. | — | — |
aspect_ratio | string | Aspect ratio of the output video. Only used in text-to-video mode (ignored when images are provided). | "16:9" | 16:9 9:16 3:4 4:3 1:1 |
audio | boolean | Whether to generate audio synchronized with the video (dialogue and sound effects). | true | — |
duration | integer | Duration of the video in seconds. | 5 | min: 1, max: 16 |
end_image | string (uri) | End frame image for the video. Must be used together with start_image for start-end-to-video mode. The aspect ratios of start and end images must be similar (ratio between 0.8 and 1.25). Supported formats: png, jpeg, jpg, webp. | — | — |
resolution | string | Resolution of the output video. | "720p" | 540p 720p 1080p |
seed | integer | Random seed. Set for reproducible generation. | — | — |
start_image | string (uri) | Start frame image for the video. When provided without an end_image, the model runs in image-to-video mode. Supported formats: png, jpeg, jpg, webp. | — | — |
prompt required string Text prompt for video generation. Maximum 5000 characters.
aspect_ratio string Aspect ratio of the output video. Only used in text-to-video mode (ignored when images are provided).
"16:9" audio boolean Whether to generate audio synchronized with the video (dialogue and sound effects).
true duration integer Duration of the video in seconds.
5 min: 1, max: 16 end_image string End frame image for the video. Must be used together with start_image for start-end-to-video mode. The aspect ratios of start and end images must be similar (ratio between 0.8 and 1.25). Supported formats: png, jpeg, jpg, webp.
resolution string Resolution of the output video.
"720p" seed integer Random seed. Set for reproducible generation.
start_image string Start frame image for the video. When provided without an end_image, the model runs in image-to-video mode. Supported formats: png, jpeg, jpg, webp.
1a8e2767ffcf Updated: 6/26/2026 2.7K runs
cinemasetfree.com