zsxkib/pyramid-flow

Text-to-Video + Image-to-Video: Pyramid Flow Autoregressive Video Generation method based on Flow Matching

Capabilities

Reference Images

Community model (estimated from hardware time)

Name	Type	Description	Default	Constraints
`prompt`*	string	Text prompt for video generation	`—`	—
`duration`	integer	Duration of the video in seconds (1-3 for canonical mode, 1-10 for non-canonical mode)	`3`	min: 1, max: 10
`frames_per_second`	integer	Frames per second (8 or 24, only applicable in canonical mode)	`8`	824
`guidance_scale`	number	Guidance Scale for text-to-video generation	`9`	min: 1, max: 15
`image`	string(uri)	Optional input image for image-to-video generation	`—`	—
`video_guidance_scale`	number	Video Guidance Scale	`5`	min: 1, max: 15

promptrequiredstring

Text prompt for video generation

durationinteger

Duration of the video in seconds (1-3 for canonical mode, 1-10 for non-canonical mode)

Default: 3min: 1, max: 10

frames_per_secondinteger

Frames per second (8 or 24, only applicable in canonical mode)

Default: 8

824

guidance_scalenumber

Guidance Scale for text-to-video generation

Default: 9min: 1, max: 15

imagestring

Optional input image for image-to-video generation

video_guidance_scalenumber

Video Guidance Scale

Default: 5min: 1, max: 15

Version: 8e221e66498aUpdated: 7/25/20269.2K runs