zsxkib/instant-id
Make realistic images of real people instantly
Capabilities
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
image * | string (uri) | Input face image | — | — |
canny_strength | number | Canny ControlNet strength, effective only if `enable_canny_controlnet` is true | 0.3 | min: 0, max: 1 |
controlnet_conditioning_scale | number | Scale for IdentityNet strength (for fidelity) | 0.8 | min: 0, max: 1.5 |
depth_strength | number | Depth ControlNet strength, effective only if `enable_depth_controlnet` is true | 0.5 | min: 0, max: 1 |
disable_safety_checker | boolean | Disable safety checker for generated images | false | — |
enable_canny_controlnet | boolean | Enable Canny ControlNet, overrides strength if set to false | false | — |
enable_depth_controlnet | boolean | Enable Depth ControlNet, overrides strength if set to false | false | — |
enable_lcm | boolean | Enable Fast Inference with LCM (Latent Consistency Models) - speeds up inference steps, trade-off is the quality of the generated image. Performs better with close-up portrait face images | false | — |
enable_pose_controlnet | boolean | Enable Openpose ControlNet, overrides strength if set to false | true | — |
enhance_nonface_region | boolean | Enhance non-face region | true | — |
face_detection_input_height | integer | Height of the input image for face detection | 640 | min: 640, max: 4096 |
face_detection_input_width | integer | Width of the input image for face detection | 640 | min: 640, max: 4096 |
guidance_scale | number | Scale for classifier-free guidance | 7.5 | min: 1, max: 50 |
ip_adapter_scale | number | Scale for image adapter strength (for detail) | 0.8 | min: 0, max: 1.5 |
lcm_guidance_scale | number | Only used when `enable_lcm` is set to True, Scale for classifier-free guidance when using LCM | 1.5 | min: 1, max: 20 |
lcm_num_inference_steps | integer | Only used when `enable_lcm` is set to True, Number of denoising steps when using LCM | 5 | min: 1, max: 10 |
negative_prompt | string | Input Negative Prompt | "" | — |
num_inference_steps | integer | Number of denoising steps | 30 | min: 1, max: 500 |
num_outputs | integer | Number of images to output | 1 | min: 1, max: 8 |
output_format | string | Format of the output images | "webp" | webp jpg png |
output_quality | integer | Quality of the output images, from 0 to 100. 100 is best quality, 0 is lowest quality. | 80 | min: 0, max: 100 |
pose_image | string (uri) | (Optional) reference pose image | — | — |
pose_strength | number | Openpose ControlNet strength, effective only if `enable_pose_controlnet` is true | 0.4 | min: 0, max: 1 |
prompt | string | Input prompt | "a person" | — |
scheduler | string | Scheduler | "EulerDiscreteScheduler" | DEISMultistepScheduler HeunDiscreteScheduler EulerDiscreteScheduler DPMSolverMultistepScheduler DPMSolverMultistepScheduler-Karras DPMSolverMultistepScheduler-Karras-SDE |
sdxl_weights | string | Pick which base weights you want to use | "stable-diffusion-xl-base-1.0" | stable-diffusion-xl-base-1.0 juggernaut-xl-v8 afrodite-xl-v2 albedobase-xl-20 albedobase-xl-v13 animagine-xl-30 anime-art-diffusion-xl anime-illust-diffusion-xl dreamshaper-xl dynavision-xl-v0610 guofeng4-xl nightvision-xl-0791 omnigen-xl pony-diffusion-v6-xl protovision-xl-high-fidel RealVisXL_V3.0_Turbo RealVisXL_V4.0_Lightning |
seed | integer | Random seed. Leave blank to randomize the seed | — | — |
image required string Input face image
canny_strength number Canny ControlNet strength, effective only if `enable_canny_controlnet` is true
0.3 min: 0, max: 1 controlnet_conditioning_scale number Scale for IdentityNet strength (for fidelity)
0.8 min: 0, max: 1.5 depth_strength number Depth ControlNet strength, effective only if `enable_depth_controlnet` is true
0.5 min: 0, max: 1 disable_safety_checker boolean Disable safety checker for generated images
false enable_canny_controlnet boolean Enable Canny ControlNet, overrides strength if set to false
false enable_depth_controlnet boolean Enable Depth ControlNet, overrides strength if set to false
false enable_lcm boolean Enable Fast Inference with LCM (Latent Consistency Models) - speeds up inference steps, trade-off is the quality of the generated image. Performs better with close-up portrait face images
false enable_pose_controlnet boolean Enable Openpose ControlNet, overrides strength if set to false
true enhance_nonface_region boolean Enhance non-face region
true face_detection_input_height integer Height of the input image for face detection
640 min: 640, max: 4096 face_detection_input_width integer Width of the input image for face detection
640 min: 640, max: 4096 guidance_scale number Scale for classifier-free guidance
7.5 min: 1, max: 50 ip_adapter_scale number Scale for image adapter strength (for detail)
0.8 min: 0, max: 1.5 lcm_guidance_scale number Only used when `enable_lcm` is set to True, Scale for classifier-free guidance when using LCM
1.5 min: 1, max: 20 lcm_num_inference_steps integer Only used when `enable_lcm` is set to True, Number of denoising steps when using LCM
5 min: 1, max: 10 negative_prompt string Input Negative Prompt
"" num_inference_steps integer Number of denoising steps
30 min: 1, max: 500 num_outputs integer Number of images to output
1 min: 1, max: 8 output_format string Format of the output images
"webp" output_quality integer Quality of the output images, from 0 to 100. 100 is best quality, 0 is lowest quality.
80 min: 0, max: 100 pose_image string (Optional) reference pose image
pose_strength number Openpose ControlNet strength, effective only if `enable_pose_controlnet` is true
0.4 min: 0, max: 1 prompt string Input prompt
"a person" scheduler string Scheduler
"EulerDiscreteScheduler" sdxl_weights string Pick which base weights you want to use
"stable-diffusion-xl-base-1.0" seed integer Random seed. Leave blank to randomize the seed
2e4785a4d80d Updated: 2/26/2026 1.0M runs
cinemasetfree.com