cjwbw/aniportrait-audio2vid

Audio-Driven Synthesis of Photorealistic Portrait Animations

Capabilities

1:14:33:416:99:162:33:2Reference ImagesSeed

Community model (estimated from hardware time)

Name	Type	Description	Default	Constraints
`audio`*	string(uri)	Input audio	`—`	—
`image`*	string(uri)	Input image	`—`	—
`fps`	integer	Frame per second in the output video	`30`	—
`guidance_scale`	number	Scale for classifier-free guidance	`3.5`	—
`height`	integer	Height of output video	`512`	—
`seed`	integer	Random seed. Leave blank to randomize the seed	`—`	—
`steps`	integer	Inference steps	`25`	—
`width`	integer	Width of output video	`512`	—

audiorequiredstring

Input audio

imagerequiredstring

Input image

fpsinteger

Frame per second in the output video

Default: 30

guidance_scalenumber

Scale for classifier-free guidance

Default: 3.5

heightinteger

Height of output video

Default: 512

seedinteger

Random seed. Leave blank to randomize the seed

stepsinteger

Inference steps

Default: 25

widthinteger

Width of output video

Default: 512

Version: 3f976d8f2308Updated: 7/25/202614.9K runs