alibaba/happyhorse-1.1
Alibaba's Happy Horse 1.1 generates videos from text, animates a single image, or builds a video from multiple reference images. Supports 720p and 1080p, 3-15 second durations, and five aspect ratios.
Capabilities
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
aspect_ratio | string | Aspect ratio of the generated video. Only applies to text-to-video and reference-to-video. When a single image is provided (image-to-video), the image's aspect ratio is used. | "16:9" | 16:9 9:16 1:1 4:3 3:4 |
duration | integer | Duration of the generated video in seconds | 5 | 3 4 5 6 7 8 9 10 11 12 13 14 15 |
images | array | Input images. Provide no images for text-to-video, one image to animate it as the first frame (image-to-video), or multiple images (up to 9) for reference-to-video. Accepts jpg/png/bmp/webp, <=10MB each, aspect ratio between 1:2.5 and 2.5:1, each side >=300px. | | — |
prompt | string | Text prompt for video generation. Optional when a single image is provided (image-to-video). Required for text-to-video and reference-to-video. When providing multiple reference images, refer to them in the prompt as [Image 1], [Image 2], etc. | "" | — |
resolution | string | Output video resolution | "1080p" | 720p 1080p |
seed | integer | Random seed for reproducible generation. Range: 0-2147483647 | — | — |
aspect_ratio string Aspect ratio of the generated video. Only applies to text-to-video and reference-to-video. When a single image is provided (image-to-video), the image's aspect ratio is used.
"16:9" duration integer Duration of the generated video in seconds
5 images array Input images. Provide no images for text-to-video, one image to animate it as the first frame (image-to-video), or multiple images (up to 9) for reference-to-video. Accepts jpg/png/bmp/webp, <=10MB each, aspect ratio between 1:2.5 and 2.5:1, each side >=300px.
prompt string Text prompt for video generation. Optional when a single image is provided (image-to-video). Required for text-to-video and reference-to-video. When providing multiple reference images, refer to them in the prompt as [Image 1], [Image 2], etc.
"" resolution string Output video resolution
"1080p" seed integer Random seed for reproducible generation. Range: 0-2147483647
57e8eee02a05 Updated: 6/26/2026 684 runs
cinemasetfree.com