← Back to all generators

pharmapsychotic/clip-interrogator

The CLIP Interrogator is a prompt engineering tool that combines OpenAI's CLIP and Salesforce's BLIP to optimize text prompts to match a given image. Use the resulting prompts with text-to-image models like Stable Diffusion to create cool art!

Capabilities

Reference Images

Cost

Community model (estimated from hardware time)

Input Parameters

image required string

Input image

clip_model_name string

Choose ViT-L for Stable Diffusion 1, ViT-H for Stable Diffusion 2, or ViT-bigG for Stable Diffusion XL.

Default: "ViT-L-14/openai"
ViT-L-14/openai ViT-H-14/laion2b_s32b_b79k ViT-bigG-14/laion2b_s39b_b160k
mode string

Prompt mode (best takes 10-20 seconds, fast takes 1-2 seconds).

Default: "best"
best classic fast negative
Version: 8151e1c9f47e Updated: 2/26/2026 4.9M runs