AI Model Catalog

Browse all available AI models across text, image, video, audio, speech, and upscale categories. Compare capabilities, costs, and parameters.

1022 models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

NegSeedGuidanceStepsSchedulerW/HSafetyMulti

Community1033.2M runs

black-forest-labs/flux-schnell

The fastest image generation model tailored for local development and personal use

SeedStepsFormatSafetyMulti

$0.003/img619.6M runs

meta/meta-llama-3-8b-instruct

An 8 billion parameter language model from Meta, fine tuned for chat completions

Temp

Community400.2M runs

salesforce/blip

Generate image captions

Refs

Community171.9M runs

meta/meta-llama-3-70b-instruct

A 70 billion parameter language model from Meta, fine tuned for chat completions

Temp

Community165.6M runs

openai/whisper

Convert speech in audio to text

Temp

Community143.5M runs

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Refs

Community111.0M runs

stability-ai/stable-diffusion

A latent text-to-image diffusion model capable of generating photo-realistic images given any text input

NegSeedGuidanceStepsSchedulerW/HMulti

Community110.9M runs

tencentarc/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Scale

Community109.4M runs

abiruyt/text-extract-ocr

A simple OCR Model that can easily extract text from an image.

Refs

Community90.1M runs

google/nano-banana

Google's latest image editing model in Gemini 2.5

RefsFormat

Community85.7M runs

nightmareai/real-esrgan

Real-ESRGAN with optional face correction and adjustable upscale

RefsScale

Community85.5M runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

RefsNegSeedGuidanceStepsSchedulerW/HSafetyMultiMaskLoRA

Community84.0M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Voice

Community80.0M runs

black-forest-labs/flux-1.1-pro

Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.

RefsSeedFormatW/H

$0.040/img67.6M runs

meta/meta-llama-3-8b

Base version of Llama 3, an 8 billion parameter language model from Meta.

Temp

Community51.2M runs

sczhou/codeformer

Robust face restoration algorithm for old photos / AI-generated faces

Refs

Community50.4M runs

black-forest-labs/flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and...

RefsSeedFormat

Community46.0M runs

prunaai/z-image-turbo

Z-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.

SeedGuidanceStepsFormatW/H

Community44.8M runs

black-forest-labs/flux-dev

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions

RefsSeedGuidanceStepsFormatSafetyMulti

$0.025/img41.2M runs

prunaai/flux-fast

This is the fastest Flux endpoint in the world.

SeedGuidanceStepsFormat

Community38.9M runs

jagilley/controlnet-scribble

Generate detailed images from scribbled drawings

RefsNegSeedScale

Community38.3M runs

prunaai/p-image-edit

A sub 1 second 0.01$ multi-image editing model built for production use cases. For image generation, check out p-image h...

SeedSafety

Community34.5M runs

yorickvp/llava-13b

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

RefsTemp

Community34.0M runs

google/nano-banana-pro

Google's state of the art image generation and editing model 🍌🍌

RefsFormat

Community32.9M runs

bytedance/seedream-4.5

Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge

RefsW/HSafety

Community31.8M runs

andreasjansson/blip-2

Answers questions about images

RefsTemp

Community31.4M runs

openai/gpt-4o-mini

Low latency, low cost version of OpenAI's GPT-4o model

RefsTemp

Community30.5M runs

bytedance/seedream-4

Unified text-to-image generation and precise single-sentence editing at up to 4K resolution

RefsW/H

Community27.5M runs

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

RefsNegSeedStepsSchedulerFormatMaskScale

Community27.1M runs

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

Community26.4M runs

bytedance/hyper-flux-8step

Hyper FLUX 8-step by ByteDance

SeedGuidanceStepsFormatW/HSafetyMulti

Community21.6M runs

black-forest-labs/flux-1.1-pro-ultra

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.

RefsSeedFormat

$0.060/img20.9M runs

prunaai/flux-kontext-fast

Ultra fast flux kontext endpoint

SeedGuidanceStepsFormat

Community19.3M runs

nicolascoutureau/video-utils

No description available

FPS

Community19.1M runs

meta/llama-2-7b-chat

A 7 billion parameter language model from Meta, fine tuned for chat completions

SeedTemp

Community18.6M runs

851-labs/background-remover

Remove backgrounds from images.

Refs

Community16.5M runs

google/nano-banana-2

Google's fast image generation model with conversational editing, multi-image fusion, and character consistency

RefsFormat

Community15.3M runs

fofr/face-to-many

Turn a face into 3D, emoji, pixel art, video game, claymation or toy

RefsNegSeedLoRA

Community15.0M runs

lucataco/remove-bg

Remove background from an image

Refs

Community14.2M runs

black-forest-labs/flux-pro

State-of-the-art image generation with top of the line prompt following, visual quality, image detail and output diversi...

RefsSeedGuidanceStepsFormatW/H

$0.050/img13.9M runs

openai/gpt-image-2

OpenAI's state-of-the-art image generation model. Create and edit images from text with strong instruction following, sh...

RefsFormat

Community12.9M runs

prunaai/p-image

A sub 1 second text-to-image model built for production use cases.

SeedW/HSafetyLoRA

Community12.7M runs

minimax/speech-02-turbo

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Designed for real-...

Voice

Community12.5M runs

datacte/proteus-v0.2

Proteus v0.2 shows subtle yet significant improvements over Version 0.1. It demonstrates enhanced prompt understanding t...

RefsNegSeedGuidanceStepsSchedulerW/HSafetyMultiMask

Community11.8M runs

fofr/sdxl-emoji

An SDXL fine-tune based on Apple Emojis

RefsNegSeedGuidanceStepsSchedulerW/HSafetyMultiMaskLoRA

Community11.8M runs

cjwbw/rembg

Remove images background

Refs

Community10.6M runs

ai-forever/kandinsky-2.2

multilingual text2image latent diffusion model

NegSeedStepsFormatW/HMulti

Community10.1M runs

meta/llama-2-70b-chat

A 70 billion parameter language model from Meta, fine tuned for chat completions

SeedTemp

Community10.1M runs

jagilley/controlnet-hough

Modify images using M-LSD line detection

RefsNegSeedScale

Community10.0M runs

black-forest-labs/flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transf...

RefsSeedFormat

Community9.8M runs

qwen/qwen-image-edit-plus

The latest Qwen-Image’s iteration with improved multi-image editing, single-image consistency, and native support for Co...

RefsSeedFormatSafety

Community9.6M runs

lucataco/codeformer

Robust face restoration algorithm for old photos/AI-generated faces

Refs

Community9.5M runs

alexgenovese/upscaler

GFPGAN aims at developing Practical Algorithms for Real-world Face and Object Restoration

RefsScale

Community8.8M runs

tencentarc/photomaker

Create photos, paintings and avatars for anyone in any style within seconds.

RefsNegSeedGuidanceSafetyMulti

Community8.8M runs

bytedance/hyper-flux-16step

Hyper FLUX 16-step by ByteDance

SeedGuidanceStepsFormatW/HSafetyMulti

Community8.7M runs

lucataco/moondream2

moondream2 is a small vision language model designed to run efficiently on edge devices

Refs

Community8.7M runs

thomasmol/whisper-diarization

⚡️ Blazing fast audio transcription with speaker diarization | Whisper Large V3 Turbo & pyannote 4.0 community-1 | word ...

Community8.5M runs

prunaai/hidream-l1-fast

This is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!

NegSeedFormat

Community8.4M runs

google/gemini-2.5-flash

Google’s hybrid “thinking” AI model optimized for speed and cost-efficiency

Temp

Community8.2M runs

ideogram-ai/ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent...

RefsSeedMask

Community8.0M runs

recraft-ai/recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide...

$0.040/img7.9M runs

wan-video/wan-2.2-i2v-fast

A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B image-to-video

RefsSeedSafetyFPS

Community7.8M runs

victor-upmeet/whisperx

Accelerated transcription, word-level timestamps and diarization with whisperX large-v3

Temp

Community7.8M runs

comfyui/any-comfyui-workflow

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui

Format

Community7.8M runs

google/imagen-4

Google's Imagen 4 flagship model

Format

Community7.7M runs

black-forest-labs/flux-2-pro

High-quality image generation and editing with support for eight reference images

RefsSeedFormatW/H

Community7.6M runs

openai/clip

Official CLIP models, generate CLIP (clip-vit-large-patch14) text & image embeddings

Refs

Community7.1M runs

black-forest-labs/flux-kontext-dev

Open-weight version of FLUX.1 Kontext

RefsSeedGuidanceStepsFormatSafety

Community6.6M runs

jingyunliang/swinir

Image Restoration Using Swin Transformer

Refs

Community6.3M runs

ai-forever/kandinsky-2

text2img model trained on LAION HighRes and fine-tuned on internal datasets

SeedGuidanceStepsSchedulerFormatW/H

Community6.2M runs

usamaehsan/controlnet-1.1-x-realistic-vision-v2.0

controlnet 1.1 lineart x realistic-vision-v2.0 (updated to v5)

RefsNegSeedGuidanceSteps

Community5.8M runs

black-forest-labs/flux-dev-lora

A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference

RefsSeedGuidanceStepsFormatSafetyMultiLoRA

Community5.7M runs

google/imagen-4-fast

Use this fast version of Imagen 4 when speed and cost are more important than quality

Format

Community5.7M runs

smoretalk/rembg-enhance

A background removal model enhanced with better matting

Refs

Community5.4M runs

datacte/proteus-v0.3

ProteusV0.3: The Anime Update

RefsNegSeedGuidanceStepsSchedulerW/HSafetyMultiMask

Community5.4M runs

deepseek-ai/deepseek-v3

DeepSeek-V3-0324 is the leading non-reasoning model, a milestone for open source

Temp

Community5.2M runs

google/gemini-3-flash

Google's most intelligent model built for speed with frontier intelligence, superior search, and grounding

Temp

Community5.1M runs

lucataco/xtts-v2

Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning

Community5.0M runs

yorickvp/llava-v1.6-mistral-7b

LLaVA v1.6: Large Language and Vision Assistant (Mistral-7B)

RefsTemp

Community5.0M runs

meta/llama-2-13b-chat

A 13 billion parameter language model from Meta, fine tuned for chat completions

SeedTemp

Community4.9M runs

pharmapsychotic/clip-interrogator

The CLIP Interrogator is a prompt engineering tool that combines OpenAI's CLIP and Salesforce's BLIP to optimize text pr...

Refs

Community4.9M runs

zsxkib/mmaudio

Add sound to video using the MMAudio V2 model. An advanced AI model that synthesizes high-quality audio from video conte...

RefsNegSeedDuration

Community4.8M runs

openai/gpt-5-nano

Fastest, most cost-effective GPT-5 model from OpenAI

Refs

Community4.8M runs

meta/llama-4-maverick-instruct

A 17 billion parameter model with 128 experts

Temp

Community4.7M runs

black-forest-labs/flux-2-klein-4b

Very fast image generation and editing model. 4 steps distilled, sub-second inference for production and near real-time ...

SeedFormatSafety

Community4.7M runs

men1scus/birefnet

Bilateral Reference for High-Resolution Dichotomous Image Segmentation (CAAI AIR 2024)

Refs

Community4.7M runs

lucataco/realistic-vision-v5.1

Implementation of Realistic Vision v5.1 with VAE

NegSeedGuidanceStepsSchedulerW/H

Community4.3M runs

black-forest-labs/flux-fill-pro

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, sea...

RefsSeedGuidanceStepsFormatMask

Community4.0M runs

qwen/qwen-image-edit-2511

An enhanced version over Qwen-Image-Edit-2509, featuring multiple improvements including notably better consistency

RefsSeedFormatSafety

Community4.0M runs

bytedance/pulid

📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment

NegSeedGuidanceFormat

Community3.9M runs

anthropic/claude-3.7-sonnet

The most intelligent Claude model and the first hybrid reasoning model on the market (claude-3-7-sonnet-20250219)

Refs

Community3.8M runs

yorickvp/llava-v1.6-vicuna-13b

LLaVA v1.6: Large Language and Vision Assistant (Vicuna-13B)

RefsTemp

Community3.8M runs

black-forest-labs/flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

SeedStepsFormatSafetyMultiLoRA

Community3.7M runs

recraft-ai/recraft-crisp-upscale

Designed to make images sharper and cleaner, Crisp Upscale increases overall quality, making visuals suitable for web us...

Refs

Community3.7M runs

meta/llama-4-scout-instruct

A 17 billion parameter model with 16 experts

Temp

Community3.6M runs

mv-lab/swin2sr

3.5 Million Runs! AI Photorealistic Image Super-Resolution and Restoration

Refs

Community3.6M runs

lucataco/sdxl-controlnet

SDXL ControlNet - Canny

RefsNegSeedSteps

Community3.5M runs

kwaivgi/kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

NegDurationImg2Vid

Community3.4M runs

bytedance/seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p res...

RefsSeedDurationImg2VidFPS

Community3.4M runs

meta/musicgen

Generate music from a prompt or melody

SeedFormatDurationTemp

Community3.3M runs

openai/gpt-image-1.5

OpenAI's latest image generation model with better instruction following and adherence to prompts

RefsFormat

Community3.3M runs

cjwbw/anything-v4.0

high-quality, highly detailed anime-style Stable Diffusion models

NegSeedGuidanceStepsSchedulerW/HMulti

Community3.3M runs

bytedance/seedream-3

A text-to-image model with support for native high-resolution (2K) image generation

SeedGuidanceW/H

Community3.3M runs

luma/photon

High-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs

Seed

$0.040/img3.2M runs

black-forest-labs/flux-krea-dev

An opinionated text-to-image model from Black Forest Labs in collaboration with Krea that excels in photorealism. Create...

RefsSeedGuidanceStepsFormatSafetyMulti

Community3.0M runs

anthropic/claude-3.5-haiku

Anthropic's fastest, most cost-effective model, with a 200K token context window (claude-3-5-haiku-20241022)

Community3.0M runs

fofr/flux-black-light

A flux lora fine-tuned on black light images

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community3.0M runs

playgroundai/playground-v2.5-1024px-aesthetic

Playground v2.5 is the state-of-the-art open-source model in aesthetic quality

RefsNegSeedGuidanceStepsSchedulerW/HSafetyMultiMask

Community3.0M runs

ideogram-ai/ideogram-v2-turbo

A fast image model with state of the art inpainting, prompt comprehension and text rendering.

RefsNegSeedMask

$0.050/img2.9M runs

black-forest-labs/flux-2-max

The highest fidelity image model from Black Forest Labs

RefsSeedFormatW/H

Community2.7M runs

cjwbw/real-esrgan

Real-ESRGAN: Real-World Blind Super-Resolution

Refs

Community2.7M runs

bytedance/seedance-1.5-pro

A joint audio-video model that accurately follows complex instructions.

RefsSeedDurationImg2VidFPS

Community2.7M runs

ideogram-ai/ideogram-v2

An excellent image model with state of the art inpainting, prompt comprehension and text rendering

RefsNegSeedMask

$0.080/img2.7M runs

kwaivgi/kling-v2.5-turbo-pro

Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and...

RefsNegGuidanceDurationImg2Vid

Community2.7M runs

methexis-inc/img2prompt

Get an approximate text prompt, with style, matching an image. (Optimized for stable-diffusion (clip ViT-L/14))

Refs

Community2.7M runs

minimax/image-01

Minimax's first image model, with character reference support

Community2.6M runs

bytedance/flux-pulid

⚡️FLUX PuLID: FLUX-dev based Pure and Lightning ID Customization via Contrastive Alignment🎭

NegSeedGuidanceFormatW/HMulti

Community2.5M runs

minimax/speech-02-hd

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high...

Voice

Community2.4M runs

prunaai/p-video

Fast video generation with built-in draft mode for rapid creative iteration. Text-to-video, image-to-video, and audio-to...

RefsSeedDurationImg2VidFPS

Community2.4M runs

flux-kontext-apps/multi-image-kontext-pro

An experimental model with FLUX Kontext Pro that can combine two input images

SeedFormat

Community2.4M runs

bytedance/seedream-5-lite

Seedream 5.0 lite: image generation with built-in reasoning, example-based editing, and deep domain knowledge

RefsFormat

Community2.4M runs

tstramer/material-diffusion

Stable diffusion fork for generating tileable outputs using v1.5 model

SeedGuidanceStepsSchedulerW/HMultiMask

Community2.4M runs

anthropic/claude-4-sonnet

Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisel...

Refs

Community2.3M runs

deepseek-ai/deepseek-r1

A reasoning model trained with reinforcement learning, on par with OpenAI o1

Temp

Community2.3M runs

ideogram-ai/ideogram-v3-quality

The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles

RefsSeedMask

Community2.1M runs

sdxl-based/realvisxl-v3-multi-controlnet-lora

RealVisXl V3 with multi-controlnet, lora loading, img2img, inpainting

RefsNegSeedGuidanceStepsSchedulerW/HSafetyMultiMaskLoRA

Community2.0M runs

ideogram-ai/ideogram-v2a

Like Ideogram v2, but faster and cheaper

Seed

Community2.0M runs

snowflake/snowflake-arctic-instruct

An efficient, intelligent, and truly open-source language model

Community2.0M runs

fofr/sticker-maker

Make stickers with AI. Generates graphics with transparent backgrounds.

NegSeedStepsFormatW/H

Community2.0M runs

google/imagen-3

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty

Format

Community2.0M runs

stability-ai/stable-diffusion-3.5-large

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and p...

RefsNegSeedGuidanceFormat

Community2.0M runs

piddnad/ddcolor

Towards Photo-Realistic Image Colorization via Dual Decoders

Refs

Community1.9M runs

mistralai/mistral-7b-v0.1

A 7 billion parameter language model from Mistral.

SeedTemp

Community1.9M runs

adirik/interior-design

Realistic interior design with text and image inputs

RefsNegSeedGuidanceSteps

Community1.9M runs

stability-ai/stable-diffusion-3

A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and ...

RefsNegSeedGuidanceStepsFormat

Community1.9M runs

daanelson/minigpt-4

A model which generates text in response to an input image and prompt.

RefsTemp

Community1.8M runs

rmokady/clip_prefix_caption

Simple image captioning model using CLIP and GPT-2

Refs

Community1.7M runs

codeplugtech/face-swap

Advance Face Swap powered by pixalto.app

Refs

Community1.7M runs

google/imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

Format

Community1.7M runs

ibm-granite/granite-3.3-8b-instruct

Granite-3.3-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for improved reasoning an...

SeedTemp

Community1.7M runs

fofr/face-to-sticker

Turn a face into a sticker

RefsNegSeedStepsW/H

Community1.7M runs

qwen/qwen-image-edit

Edit images using a prompt. This model extends Qwen-Image’s unique text rendering capabilities to image editing tasks, e...

RefsSeedFormatSafety

Community1.6M runs

openai/gpt-4.1-mini

Fast, affordable version of GPT-4.1

RefsTemp

Community1.6M runs

bytedance/seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p r...

RefsSeedDurationImg2VidFPS

Community1.6M runs

pollinations/modnet

A deep learning approach to remove background & adding new background image

Refs

Community1.6M runs

black-forest-labs/flux-2-klein-9b

4 step distilled version of FLUX.2 [klein]. A foundation model for maximum flexibility and control

SeedFormatSafety

Community1.6M runs

tencentarc/photomaker-style

Create photos, paintings and avatars for anyone in any style within seconds. (Stylization version)

RefsNegSeedGuidanceSafetyMulti

Community1.6M runs

openai/gpt-image-1

A multimodal image generation model that creates high-quality images. You need to bring your own verified OpenAI key to ...

RefsFormat

Community1.6M runs

qwen/qwen-image

An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.

RefsNegSeedGuidanceStepsFormatSafetyLoRA

Community1.6M runs

zsxkib/realistic-voice-cloning

Create song covers with any RVC v2 trained AI voice from audio files.

Format

Community1.5M runs

fofr/latent-consistency-model

Super-fast, 0.6s per image. LCM with img2img, large batching and canny controlnet

RefsSeedGuidanceStepsW/HSafetyMulti

Community1.5M runs

kwaivgi/kling-v1.6-standard

Generate 5s and 10s videos in 720p resolution at 30fps

RefsNegGuidanceDurationImg2Vid

Community1.5M runs

black-forest-labs/flux-fill-dev

Open-weight inpainting model for editing and extending images. Guidance-distilled from FLUX.1 Fill [pro].

RefsSeedGuidanceStepsFormatSafetyMultiMaskLoRA

Community1.5M runs

pseudoram/rvc-v2

Speech to speech with any RVC v2 trained AI voice

Format

Community1.5M runs

megvii-research/nafnet

Nonlinear Activation Free Network for Image Restoration

Refs

Community1.5M runs

topazlabs/image-upscale

Professional-grade image upscaling, from Topaz Labs

RefsFormat

Community1.4M runs

zsxkib/blip-3

Blip 3 / XGen-MM, Answers questions about images ({blip3,xgen-mm}-phi3-mini-base-r-v1)

RefsTemp

Community1.3M runs

zsxkib/molmo-7b

allenai/Molmo-7B-D-0924, Answers questions and caption about images

RefsTemp

Community1.3M runs

google/gemini-3-pro

Google's most advanced reasoning Gemini model

Temp

Community1.3M runs

orpatashnik/styleclip

Text-Driven Manipulation of StyleGAN Imagery

Community1.3M runs

black-forest-labs/flux-2-dev

Quality image generation and editing with support for reference images

RefsSeedFormatW/HSafety

Community1.2M runs

openai/gpt-5

OpenAI's new model excelling at coding, writing, and reasoning.

Refs

$0.003/img1.2M runs

google/gemini-2.5-flash-image

Google's latest image generation model in Gemini 2.5

RefsFormat

Community1.2M runs

xai/grok-imagine-video

Generate videos using xAI's Grok Imagine Video model

RefsDuration

Community1.2M runs

microsoft/bringing-old-photos-back-to-life

Bringing Old Photos Back to Life

Refs

Community1.2M runs

openai/gpt-4.1-nano

Fastest, most cost-effective GPT-4.1 model from OpenAI

RefsTemp

Community1.2M runs

google/gemini-3.1-pro

Google's most intelligent model, with improved reasoning and a new medium thinking level

Temp

Community1.2M runs

openai/gpt-5-mini

Faster version of OpenAI's flagship GPT-5 model

Refs

$0.004/img1.2M runs

nvidia/sana-sprint-1.6b

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

SeedGuidanceFormatW/H

Community1.1M runs

black-forest-labs/flux-depth-dev

Open-weight depth-aware image generation. Edit images while preserving spatial relationships.

SeedGuidanceStepsFormatSafetyMulti

Community1.1M runs

philz1337x/controlnet-deliberate

Modify images with canny edge detection and Deliberate model twitter: @philz1337x

RefsNegSeedScale

Community1.1M runs

qwen/qwen3-235b-a22b-instruct-2507

Updated Qwen3 model for instruction following

Temp

Community1.1M runs

prunaai/wan-2.2-image

This model generates beautiful cinematic 2 megapixel images in 3-4 seconds and is derived from the Wan 2.2 model through...

SeedFormat

Community1.1M runs

riffusion/riffusion

Stable diffusion for real-time music generation

Steps

Community1.1M runs

recraft-ai/recraft-remove-background

Automated background removal for images. Tuned for AI-generated content, product photos, portraits, and design workflows

Refs

Community1.1M runs

lucataco/ssd-1b

Segmind Stable Diffusion Model (SSD-1B) is a distilled 50% smaller version of SDXL, offering a 60% speedup while maintai...

RefsNegSeedGuidanceStepsSchedulerW/HSafetyMultiMaskLoRA

Community1.0M runs

minimax/speech-2.6-turbo

Low‑latency MiniMax Speech 2.6 Turbo brings multilingual, emotional text-to-speech to Replicate with 300+ voices and rea...

Voice

Community1.0M runs

zsxkib/instant-id

Make realistic images of real people instantly

RefsNegSeedGuidanceStepsSchedulerFormatSafetyMulti

Community1.0M runs

kwaivgi/kling-v2.6-motion-control

Enables precise control of character actions and expressions from a reference image.

Refs

Community1.0M runs

fermatresearch/sdxl-controlnet-lora

'''Last update: Now supports img2img.''' SDXL Canny controlnet with LoRA support.

RefsNegSeedGuidanceStepsSchedulerMultiLoRA

Community997.7K runs

fermatresearch/magic-image-refiner

A better alternative to SDXL refiners, providing a lot of quality and detail. Can also be used for inpainting or upscali...

RefsNegSeedGuidanceStepsSchedulerMask

Community950.6K runs

flux-kontext-apps/restore-image

Use FLUX Kontext to restore, fix scratches and damage, and colorize old photos

RefsSeedFormat

Community948.7K runs

stability-ai/stable-diffusion-3.5-large-turbo

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and p...

RefsNegSeedGuidanceFormat

Community931.9K runs

lucataco/hotshot-xl

😊 Hotshot-XL is an AI text-to-GIF model trained to work alongside Stable Diffusion XL

NegSeedStepsSchedulerW/H

Community909.0K runs

lucataco/frame-extractor

Extract the first or last frame from any video file as a high-quality image

Community890.3K runs

bytedance/seedance-2.0

ByteDance's multimodal video generation model with native audio, multimodal reference inputs, and intelligent duration c...

RefsSeedDurationImg2Vid

Community872.5K runs

meta/meta-llama-3-70b

Base version of Llama 3, a 70 billion parameter language model from Meta.

Temp

Community855.7K runs

lucataco/sdxl-clip-interrogator

CLIP Interrogator for SDXL optimizes text prompts to match a given image

Refs

Community848.7K runs

jagilley/controlnet-canny

Modify images using canny edge detection

RefsNegSeedScale

Community835.4K runs

topazlabs/video-upscale

Video Upscaling from Topaz Labs

Community831.1K runs

lucataco/qwen-vl-chat

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible int...

Refs

Community825.7K runs

bytedance/seedance-1-pro-fast

A faster and cheaper version of Seedance 1 Pro

RefsSeedDurationFPS

Community822.5K runs

kwaivgi/kling-v1.6-pro

Generate 5s and 10s videos in 1080p resolution

RefsNegGuidanceDurationImg2Vid

Community812.1K runs

fofr/become-image

Adapt any picture of a face into another image

RefsNegSeedSafety

Community796.8K runs

zylim0702/qr_code_controlnet

ControlNet QR Code Generator: Simplify QR code creation for various needs using ControlNet's user-friendly neural interf...

NegSeedGuidanceStepsSchedulerMulti

Community796.4K runs

ibm-granite/granite-3.1-8b-instruct

Granite-3.1-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction followin...

Temp

Community778.1K runs

pixverse/pixverse-v5

Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anim...

RefsNegSeedDurationImg2Vid

Community769.6K runs

prunaai/p-image-upscale

Fastest image upscaler in the world (<1s) supporting outputs up to 128 MP. contact us for dedicated endpoints.

RefsFormatSafety

Community746.7K runs

qwen/qwen-edit-multiangle

Camera-aware edits for Qwen/Qwen-Image-Edit-2509 with Lightning + multi-angle LoRA

RefsSeedStepsFormatSafetyLoRA

Community724.4K runs

runwayml/gen4-image

Runway's Gen-4 Image model with references. Use up to 3 reference images to create the exact image you need. Capture eve...

RefsSeed

Community716.0K runs

qwen/qwen3-tts

A unified Text-to-Speech demo featuring three powerful modes: Voice, Clone and Design

Community712.4K runs

jagilley/controlnet-depth2img

Modify images using depth maps

RefsNegSeedScale

Community680.2K runs

usamaehsan/controlnet-x-ip-adapter-realistic-vision-v5

Inpainting || multi-controlnet || single-controlnet || ip-adapter || ip adapter face || ip adapter plus || No ip adapter

NegSeedGuidanceStepsSchedulerMulti

Community674.7K runs

chenxwh/cogvlm2-video

CogVLM2: Visual Language Models for Image and Video Understanding

Temp

Community672.6K runs

wan-video/wan-2.2-5b-fast

The fastest Wan 2.2 text-to-image and image-to-video model

RefsSeedSafetyFPS

Community672.6K runs

google/veo-3.1-fast

New and improved version of Veo 3 Fast, with higher-fidelity video, context-aware audio and last frame support

RefsNegSeedDuration

Community670.7K runs

minimax/video-01

Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a charac...

Img2Vid

Community663.7K runs

meta/llama-2-7b

Base version of Llama 2 7B, a 7 billion parameter language model

SeedTemp

Community659.8K runs

firtoz/trellis

A powerful 3D asset generation model

Seed

Community659.4K runs

fermatresearch/high-resolution-controlnet-tile

UPDATE: new upscaling algorithm for a much improved image quality. Fermat.app open-source implementation of an efficient...

RefsNegSeedGuidanceStepsScheduler

Community651.7K runs

anthropic/claude-4.5-sonnet

Claude Sonnet 4.5 is the best coding model to date, with significant improvements across the entire development lifecycl...

Refs

Community647.2K runs

cjwbw/bigcolor

Colorization using a Generative Color Prior for Natural Images

Refs

Community637.7K runs

jagilley/controlnet-hed

Modify images using HED maps

RefsNegSeedScale

Community622.5K runs

openai/gpt-oss-20b

20b open-weight language model from OpenAI

Temp

Community621.9K runs

google/upscaler

Upscale images 2x or 4x times

Refs

Community619.6K runs

adirik/realvisxl-v3.0-turbo

Photorealism with RealVisXL V3.0 Turbo based on SDXL

RefsNegSeedGuidanceStepsSchedulerW/HSafetyMultiMask

Community615.3K runs

aaronaftab/mirage-ghibli

Ghiblify any image, 10x cheaper/faster than GPT 4o

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community609.9K runs

openai/gpt-image-1-mini

A cost-efficient version of GPT Image 1

RefsFormat

Community608.4K runs

tencentarc/vqfr

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Refs

Community601.9K runs

recraft-ai/recraft-vectorize

Convert raster images to high-quality SVG format with precision and clean vector paths, perfect for logos, icons, and sc...

Refs

Community592.4K runs

yoyo-nb/thin-plate-spline-motion-model

Thin-Plate Spline Motion Model for Image Animation

Community592.4K runs

rafaelgalle/whisper-diarization-advanced

Ultra-fast, customizable speech-to-text and speaker diarization for noisy, multi-speaker audio. Includes advanced noise ...

Community579.5K runs

google-research/maxim

Multi-Axis MLP for Image Processing

Refs

Community568.6K runs

google/imagen-3-fast

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality

Format

Community565.9K runs

ibm-granite/granite-8b-code-instruct-128k

Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of u...

SeedTemp

Community556.2K runs

pnyompen/sd-controlnet-lora

SD1.5 Canny controlnet with LoRA support.

RefsNegSeedGuidanceStepsSchedulerMultiLoRA

Community548.9K runs

kwaivgi/kling-v3-omni-video

Kling Video 3.0 Omni: Unified multimodal video generation with reference images, video editing, native audio, and multi-...

RefsDurationImg2Vid

Community535.0K runs

ideogram-ai/ideogram-character

Generate consistent characters from a single reference image. Outputs can be in many styles. You can also use inpainting...

RefsSeedMask

Community534.9K runs

andreasjansson/tile-morph

Create tileable animations with seamless transitions

GuidanceStepsW/HFPS

Community529.4K runs

openai/gpt-5-structured

GPT-5 with support for structured outputs, web search and custom tools

Refs

Community528.8K runs

openai/gpt-4o

OpenAI's high-intelligence chat model

RefsTemp

Community526.4K runs

minimax/music-01

Quickly generate up to 1 minute of music with lyrics and vocals in the style of a reference track

Voice

Community513.6K runs

philz1337x/crystal-upscaler

High-precision image upscaler optimized for portraits, faces and products. One of the upscale modes powered by Clarity A...

RefsFormatScale

Community507.5K runs

google/veo-3.1

New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame suppo...

RefsNegSeedDuration

Community504.4K runs

bria/remove-background

Bria AI's remove background model

Refs

Community498.1K runs

cjwbw/rudalle-sr

Real-ESRGAN super-resolution model from ruDALL-E

RefsScale

Community486.1K runs

deepseek-ai/deepseek-v3.1

Latest hybrid thinking model from Deepseek

Temp

Community485.6K runs

arielreplicate/deoldify_image

Add colours to old images

Refs

Community473.3K runs

replicate/train-rvc-model

Train your own custom RVC model

Community466.8K runs

ibm-granite/granite-3.2-8b-instruct

Granite-3.2-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for reasoning and instruc...

Temp

Community460.4K runs

codeplugtech/background_remover

Remove background from image

Refs

Community451.3K runs

wavespeedai/wan-2.1-i2v-480p

Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pus...

RefsNegSeedStepsSafetyLoRA

Community438.7K runs

qwen/qwen-image-edit-plus-lora

Qwen Image Edit 2509 LoRA explorer, uses HuggingFace URLs to load any safetensor

RefsSeedFormatSafetyLoRA

Community436.2K runs

meta/llama-2-70b

Base version of Llama 2, a 70 billion parameter language model from Meta.

SeedTemp

Community426.8K runs

ibm-granite/granite-3.0-2b-instruct

Granite-3.0-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following t...

Temp

Community420.3K runs

black-forest-labs/flux-canny-pro

Professional edge-guided image generation. Control structure and composition using Canny edge detection

SeedGuidanceStepsFormat

Community417.8K runs

andreasjansson/illusion

Monster Labs' control_v1p_sd15_qrcode_monster ControlNet on top of SD 1.5

RefsNegSeedGuidanceStepsW/HMulti

Community409.3K runs

openai/o4-mini

OpenAI's fast, lightweight reasoning model

Refs

Community405.1K runs

ideogram-ai/ideogram-v3-balanced

Balance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styl...

RefsSeedMask

Community391.2K runs

ideogram-ai/ideogram-v2a-turbo

Like Ideogram v2 turbo, but now faster and cheaper

Seed

Community383.1K runs

recraft-ai/recraft-v3-svg

Recraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images incl...

$0.040/img370.3K runs

lucataco/qwen2-vl-7b-instruct

Latest model in the Qwen family for chatting with video and image models

Community365.2K runs

bria/expand-image

Bria Expand expands images beyond their borders in high quality. Resizing the image by generating new pixels to expand t...

RefsNegSeed

Community350.7K runs

minimax/hailuo-02

Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). I...

DurationImg2Vid

Community335.2K runs

recraft-ai/recraft-20b

Affordable and fast images

Community333.7K runs

jagilley/controlnet-normal

Modify images using normal maps

RefsNegSeedScale

Community330.8K runs

lucataco/animate-diff

Animate Your Personalized Text-to-Image Diffusion Models

NegSeedGuidanceSteps

Community329.0K runs

lucataco/real-esrgan-video

Real-ESRGAN Video Upscaler

Community312.7K runs

kwaivgi/kling-v3-motion-control

Kling 3.0 motion control: transfer motion from a reference video to any character image with improved consistency and qu...

Refs

Community309.9K runs

luma/reframe-image

Change the aspect ratio of any photo using AI (not cropping)

Refs

Community308.2K runs

suno-ai/bark

🔊 Text-Prompted Generative Audio Model

Community305.3K runs

black-forest-labs/flux-redux-dev

Open-weight image variation model. Create new versions while preserving key elements of your original.

SeedGuidanceStepsFormatSafetyMulti

Community303.1K runs

black-forest-labs/flux-2-flex

Max-quality image generation and editing with support for ten reference images

RefsSeedGuidanceStepsFormatW/H

Community303.1K runs

bria/eraser

SOTA Object removal, enables precise removal of unwanted objects from images while maintaining high-quality outputs. Tra...

RefsMask

Community302.9K runs

anotherjesse/zeroscope-v2-xl

Zeroscope V2 XL & 576w

NegSeedGuidanceStepsW/HFPS

Community302.7K runs

luma/photon-flash

Accelerated variant of Photon prioritizing speed while maintaining quality

Seed

$0.025/img298.0K runs

black-forest-labs/flux-depth-pro

Professional depth-aware image generation. Edit images while preserving spatial relationships.

SeedGuidanceStepsFormat

Community297.8K runs

j-min/clip-caption-reward

Fine-grained Image Captioning with CLIP Reward

Refs

Community296.1K runs

codeslake/ifan-defocus-deblur

Removes defocus blur in an image

Refs

Community285.9K runs

openai/gpt-5.2

The best model for coding and agentic tasks across industries

Refs

Community285.3K runs

pixverse/lipsync

Generate realistic lipsync animations from audio for high-quality synchronization

Community282.9K runs

openai/gpt-4.1

OpenAI's Flagship GPT model for complex tasks.

RefsTemp

Community282.7K runs

leonardoai/lucid-origin

Artistic and high-quality visuals with improved prompt adherence, diversity, and definition

Multi

Community280.9K runs

bytedance/bagel

🥯ByteDance Seed's Bagel Unified multimodal AI that generates images, edits images, and understands images in one 7B par...

RefsSeedStepsFormat

Community272.8K runs

deforum/deforum_stable_diffusion

Animating prompts with stable diffusion

SeedFPS

Community266.7K runs

black-forest-labs/flux-kontext-dev-lora

FLUX.1 Kontext[dev] image editing model for running lora finetunes

RefsSeedGuidanceStepsFormatSafetyLoRA

Community265.5K runs

kwaivgi/kling-v3-video

Kling Video 3.0: Generate cinematic videos up to 15 seconds with multi-shot control, native audio, and improved consiste...

NegDurationImg2Vid

Community262.5K runs

joehoover/instructblip-vicuna13b

An instruction-tuned multi-modal model based on BLIP-2 and Vicuna-13B

Community257.5K runs

runwayml/gen4-aleph

A new way to edit, transform and generate video

RefsSeed

Community248.9K runs

ibm-granite/granite-vision-3.3-2b

Granite-vision-3.3-2b is a compact and efficient vision-language model, specifically designed for visual document unders...

RefsSeedTemp

Community247.9K runs

pixverse/pixverse-v4.5

Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex action...

RefsNegSeedDurationImg2Vid

Community246.8K runs

openai/dall-e-3

An AI system that can create realistic images and art from a description in natural language.

Community246.6K runs

nvidia/sana

A fast image model with wide artistic range and resolutions up to 4096x4096

NegSeedGuidanceStepsW/H

Community238.9K runs

openai/gpt-5.1

The best model for coding and agentic tasks with configurable reasoning effort.

Refs

Community236.7K runs

shreejalmaharjan-27/tiktok-short-captions

Generate Tiktok-Style Captions powered by Whisper (GPU)

Temp

Community232.4K runs

openai/gpt-oss-120b

120b open-weight language model from OpenAI

Temp

Community230.2K runs

lucataco/dreamshaper-xl-turbo

DreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed t...

NegSeedGuidanceStepsSchedulerW/HSafetyMulti

Community229.7K runs

ibm-granite/granite-4.0-h-small

Granite-4.0-H-Small is a 32B parameter long-context instruct model finetuned from Granite-4.0-H-Small-Base using a combi...

SeedTemp

Community228.6K runs

openai/sora-2

OpenAI's Flagship video generation with synced audio

Community228.5K runs

resemble-ai/chatterbox

Generate expressive, natural speech. Features unique emotion control, instant voice cloning from short audio, and built-...

SeedTemp

Community228.2K runs

jingyunliang/hcflow-sr

Image Super-Resolution

Refs

Community223.0K runs

flux-kontext-apps/multi-image-kontext-max

An experimental FLUX Kontext model that can combine two input images

SeedFormat

Community222.6K runs

google/veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

RefsNegSeedDuration

Community220.2K runs

xinntao/esrgan

Image 4x super-resolution

Refs

Community218.1K runs

fofr/sdxl-multi-controlnet-lora

Multi-controlnet, lora loading, img2img, inpainting

RefsNegSeedGuidanceStepsSchedulerW/HSafetyMultiMaskLoRA

Community217.9K runs

fofr/color-matcher

Color match and white balance fixes for images

Refs

Community217.0K runs

meta/llama-2-13b

Base version of Llama 2 13B, a 13 billion parameter language model

SeedTemp

Community209.4K runs

bytedance/seedance-2.0-fast

A faster variant of Seedance 2.0 for quicker video generation with multimodal inputs and native audio.

RefsSeedDurationImg2Vid

Community209.0K runs

black-forest-labs/flux-canny-dev

Open-weight edge-guided image generation. Control structure and composition using Canny edge detection.

SeedGuidanceStepsFormatSafetyMulti

Community208.7K runs

google/lyria-3

Generate 30-second music clips from text prompts or images with Lyria 3, Google's music generation model

Seed

Community208.2K runs

google/gemini-3.1-flash-tts

Google's fast, expressive text-to-speech model with 30 voices and 70+ language support

Voice

Community205.2K runs

minimax/speech-2.8-turbo

Minimax Speech 2.8 Turbo: Turn text into natural, expressive speech with voice cloning, emotion control, and support for...

Voice

Community194.8K runs

wan-video/wan-2.5-i2v

Alibaba Wan 2.5 Image to video generation with background audio

RefsNegSeedDuration

Community194.1K runs

wan-video/wan-2.2-t2v-fast

A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video

SeedSafetyFPS

Community190.9K runs

cjwbw/supir

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This version uses LLaVA-13b for captioning.

RefsNegSeed

Community190.5K runs

awerks/neon-tts

NeonAI Coqui AI TTS Plugin.

Community187.5K runs

minimax/speech-2.6-hd

MiniMax Speech 2.6 HD delivers studio-quality multilingual text-to-audio on Replicate with nuanced prosody, subtitle exp...

Voice

Community187.2K runs

wavespeedai/wan-2.1-t2v-480p

Accelerated inference for Wan 2.1 14B text to video, a comprehensive and open suite of video foundation models that push...

NegSeedStepsSafetyLoRA

Community184.6K runs

prunaai/p-image-edit-lora

Use trained LoRAs from the https://replicate.com/prunaai/p-image-edit-trainer. Find or contribute LoRAs here: https://hu...

SeedSafetyLoRA

Community183.2K runs

kwaivgi/kling-v2.6

Kling 2.6 Pro: Top-tier image-to-video with cinematic visuals, fluid motion, and native audio generation

NegDurationImg2Vid

Community181.6K runs

ibm-granite/granite-3.0-8b-instruct

Granite-3.0-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction followin...

Temp

Community181.4K runs

flux-kontext-apps/change-haircut

Quickly change someone's hair style and hair color, powered by FLUX.1 Kontext [pro]

RefsSeedFormat

Community179.2K runs

flux-kontext-apps/multi-image-list

FLUX Kontext max with list input for multiple images

RefsSeedFormat

Community176.2K runs

minimax/video-01-live

An image-to-video (I2V) model specifically trained for Live2D and general animation use cases

Img2Vid

Community175.9K runs

jagilley/controlnet-pose

Modify images with humans using pose detection

RefsNegSeedScale

Community175.6K runs

afiaka87/tortoise-tts

Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

Seed

Community173.3K runs

google/veo-3-fast

A faster and cheaper version of Google’s Veo 3 video model, with audio

RefsNegSeedDuration

Community168.8K runs

jagilley/controlnet-seg

Modify images using semantic segmentation

RefsNegSeedScale

Community166.8K runs

fewjative/ultimate-sd-upscale

Ultimate SD Upscale with ControlNet Tile

RefsNegSeedGuidanceStepsScheduler

Community165.8K runs

lightricks/ltx-video

LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produ...

RefsNegSeedGuidanceSteps

Community165.3K runs

yangxy/gpen

Blind Face Restoration in the Wild

Refs

Community165.0K runs

bria/image-3.2

Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional a...

NegSeedGuidance

Community163.9K runs

cjwbw/sadtalker

Stylized Audio-Driven Single Image Talking Face Animation

Community159.4K runs

cjwbw/damo-text-to-video

Multi-stage text-to-video generation

SeedStepsFPS

Community157.0K runs

cjwbw/videocrafter

VideoCrafter2: Text-to-Video and Image-to-Video Generation and Editing

Seed

Community154.3K runs

bytedance/omni-human

Turns your audio/video/images into professional-quality animated videos

Refs

Community153.5K runs

replicate/flan-t5-xl

A language model by Google for tasks like classification, summarization, and more

Temp

Community151.3K runs

flux-kontext-apps/cartoonify

Turn your image into a cartoon with FLUX.1 Kontext [pro]

RefsSeedFormat

Community150.8K runs

appmeloncreator/platmoji-beta

This is an emoji generator fine tuned with Flux. (btw thx so much for the support on this)

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community143.2K runs

stability-ai/stablelm-tuned-alpha-7b

7 billion parameter version of Stability AI's language model

Temp

Community140.6K runs

inworld/realtime-tts-1.5-max

Highest-quality realtime text-to-speech with <200ms latency, emotion control, and 15-language support

TempVoice

Community140.6K runs

cjwbw/vqfr

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Refs

Community140.6K runs

resemble-ai/chatterbox-turbo

The fastest open source TTS model without sacrificing quality.

SeedTempVoice

Community138.4K runs

zsxkib/diffbir

✨DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

SeedGuidanceSteps

Community138.3K runs

qwen/qwen-image-2512

Qwen Image 2512 is an improved version of Qwen Image with more realistic human generation, finer textures, and stronger ...

RefsNegSeedGuidanceStepsFormatW/HSafety

Community137.1K runs

google-deepmind/gemma-2b-it

2B instruct version of Google’s Gemma model

Temp

Community134.4K runs

adirik/flux-cinestill

Flux lora, use "CNSTLL" to trigger

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community133.1K runs

lucataco/open-dalle-v1.1

A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base...

RefsNegSeedGuidanceStepsSchedulerW/HSafetyMultiMask

Community132.7K runs

adirik/styletts2

Generates speech from text

Seed

Community132.5K runs

lucataco/florence-2-base

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Refs

Community132.3K runs

ali-vilab/i2vgen-xl

RESEARCH/NON-COMMERCIAL USE ONLY: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

RefsSeedGuidanceSteps

Community128.3K runs

lucataco/apollo-7b

Apollo 7B - An Exploration of Video Understanding in Large Multimodal Models

Temp

Community124.7K runs

flux-kontext-apps/text-removal

Remove all text from an image with FLUX.1 Kontext

RefsSeedFormat

Community123.8K runs

bytedance/dreamina-3.1

4MP text-to-image generation with enhanced cinematic-quality image generation with precise style control, improved text ...

SeedW/H

Community122.8K runs

recraft-ai/recraft-20b-svg

Affordable and fast vector images

Community122.8K runs

cjwbw/supir-v0q

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0Q model and does NOT use...

RefsNegSeed

Community120.4K runs

andreasjansson/stable-diffusion-animation

Animate Stable Diffusion by interpolating between two prompts

SeedGuidanceStepsFormatW/H

Community119.6K runs

tencent/hunyuan-video

A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from tex...

SeedW/HFPS

Community116.9K runs

nightmareai/latent-sr

Upscale images with the latent diffusion superresolution model

RefsSteps

Community116.7K runs

xai/grok-text-to-speech

Convert text to natural-sounding speech with xAI's Grok TTS. 5 voices, 20 languages, expressive speech tags, and high-fi...

FormatVoice

Community115.6K runs

black-forest-labs/flux-1.1-pro-ultra-finetuned

Inference model for FLUX 1.1 [pro] Ultra using custom `finetune_id`. Supports 4MP images and raw mode for realism

RefsSeedFormat

Community110.6K runs

bytedance/latentsync

LatentSync: generate high-quality lip sync animations

SeedGuidance

Community110.3K runs

ibm-granite/granite-20b-code-instruct-8k

Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of u...

SeedTemp

Community110.0K runs

bria/increase-resolution

Bria Increase resolution upscales the resolution of any image. It increases resolution using a dedicated upscaling metho...

Refs

Community108.2K runs

runwayml/gen4-image-turbo

Gen-4 Image Turbo is cheaper and 2.5x faster than Gen-4 Image. An image model with references, use up to 3 reference ima...

RefsSeed

Community108.2K runs

lucataco/ace-step

A Step Towards Music Generation Foundation Model text2music

SeedGuidanceSchedulerDuration

Community107.5K runs

minimax/speech-2.8-hd

Minimax Speech 2.8 HD focuses on high-fidelity audio generation with features like studio-grade quality, flexible emotio...

Voice

Community106.9K runs

reve/create

Image generation model from Reve

Seed

Community106.0K runs

google/veo-2

State of the art video generation model. Veo 2 can faithfully follow simple and complex instructions, and convincingly s...

RefsSeedDuration

Community105.9K runs

google/gemini-3.5-flash

Google's fast multimodal model with frontier reasoning across agents, coding, and long-context tasks

Temp

Community105.1K runs

stability-ai/stable-diffusion-3.5-medium

2.5 billion parameter image model with improved MMDiT-X architecture

RefsNegSeedGuidanceFormat

Community104.8K runs

open-mmlab/pia

Personalized Image Animator

RefsNegSeedGuidance

Community103.5K runs

cswry/seesr

SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

RefsNegSeedGuidanceStepsScale

Community103.0K runs

runwayml/gen4-turbo

Generate 5s and 10s 720p videos fast

RefsSeedDuration

Community101.6K runs

cjwbw/seamless_communication

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Community100.2K runs

replicate/llama-7b

Transformers implementation of the LLaMA language model

Temp

Community99.4K runs

flux-kontext-apps/face-to-many-kontext

Become a character, in style

RefsSeedFormatMulti

Community98.6K runs

anthropic/claude-4.5-haiku

Claude Haiku 4.5 gives you similar levels of coding performance but at one-third the cost and more than twice the speed

$0.012/img98.3K runs

reve/edit

Image editing model from Reve

Refs

Community98.1K runs

chigozienri/mediapipe-face

batch or individual face detection with mediapipe

Community96.1K runs

openai/gpt-5.4

OpenAI's most capable frontier model for complex professional work, coding, and multi-step reasoning.

Refs

Community95.9K runs

daanelson/whisperx

Accelerated transcription of audio using WhisperX

Community94.4K runs

cuuupid/glm-4v-9b

GLM-4V is a multimodal model released by Tsinghua University that is competitive with GPT-4o and establishes a new SOTA ...

Refs

Community93.5K runs

xai/grok-imagine-video-1.5

Image-to-video with synchronized audio using xAI's Grok Imagine Video 1.5 preview model

RefsDuration

Community92.0K runs

aleksa-codes/flux-ghibsky-illustration

Flux LoRA, use 'GHIBSKY style' to trigger generation, creates serene and enchanting landscapes with vibrant, surreal ski...

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community91.2K runs

kwaivgi/kling-v2.0

Generate 5s and 10s videos in 720p resolution

NegGuidanceDurationImg2Vid

Community90.4K runs

fictions-ai/autocaption

Automatically add captions to a video

Community90.1K runs

flux-kontext-apps/portrait-series

Create a series of portrait photos from a single image

RefsFormatMulti

Community89.0K runs

openai/sora-2-pro

OpenAI's Most advanced synced-audio video generation

Community88.8K runs

datacte/flux-aesthetic-anime

Flux lora, trained on the unique style and aesthetic of ghibli retro anime

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community88.7K runs

kwaivgi/kling-v2.1-master

A premium version of Kling v2.1 with superb dynamics and prompt adherence. Generate 1080p 5s and 10s videos from text or...

NegDurationImg2Vid

Community88.5K runs

wan-video/wan-2.7-image-pro

Generate and edit high-quality images with Alibaba's Wan 2.7 Pro with 4K output, thinking mode, text-to-image, multi-ima...

SeedMulti

Community88.5K runs

wavespeedai/wan-2.1-i2v-720p

Accelerated inference for Wan 2.1 14B image to video with high resolution, a comprehensive and open suite of video found...

RefsNegSeedStepsSafetyLoRA

Community86.8K runs

chenxwh/openvoice

Updated to OpenVoice v2: Versatile Instant Voice Cloning

Community84.3K runs

inworld/realtime-tts-1.5-mini

Ultra-fast, cost-efficient realtime text-to-speech with ~120ms latency and 15-language support

TempVoice

Community83.6K runs

lucataco/trim-video

Simple tool to quickly trim a video or audio file

FormatDuration

Community81.6K runs

arielreplicate/robust_video_matting

extract foreground of a video

Community81.0K runs

lightricks/ltx-2-fast

Ideal for rapid ideation and mobile workflows. Perfect for creators who need instant feedback, real-time previews, or hi...

RefsDuration

Community80.4K runs

mirelo/video-to-sfx-v1.5

Generate synced sounds for any video and return it with its new soundtrack - now enhanced in version 1.5 for improved so...

SeedStepsDuration

Community78.5K runs

anthropic/claude-opus-4.7

Anthropic's most capable model with a step-change improvement in agentic coding, better vision, and stronger multi-step ...

Refs

Community75.0K runs

minimax/video-01-director

Generate videos with specific camera movements

Img2Vid

Community74.5K runs

m1guelpf/whisper-subtitles

Generate subtitles from an audio file, using OpenAI's Whisper model.

Community73.9K runs

flux-kontext-apps/professional-headshot

Create a professional headshot photo from any single image

RefsSeedFormat

Community72.7K runs

wan-video/wan2.6-i2v-flash

Image-to-video generation with optional audio, multi-shot narrative support, and faster inference

RefsNegSeedDuration

Community72.5K runs

black-forest-labs/flux-redux-schnell

Fast, efficient image variation model for rapid iteration and experimentation.

SeedStepsFormatSafetyMulti

Community71.3K runs

xai/grok-imagine-image-quality

xAI's higher-quality image model with sharper details, better text rendering, and 2k output

Refs

Community71.3K runs

wan-video/wan-2.2-s2v

Generate a video from an audio clip and a reference image

RefsSeed

Community69.0K runs

cjwbw/rmgb

Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-sourc...

Refs

Community67.4K runs

fofr/tooncrafter

Create videos from illustrated input images

NegSeedLoop

Community66.5K runs

meta/llama-guard-4-12b

No description available

RefsTemp

Community63.2K runs

google/lyria-2

Lyria 2 is a music generation model that produces 48kHz stereo audio through text-based prompts

NegSeed

Community62.9K runs

andreasjansson/musicgen-looper

Generate fixed-bpm loops from text prompts

SeedFormatTemp

Community60.7K runs

xai/grok-imagine-r2v

Generate videos guided by reference images using xAI's Grok Imagine Video model

RefsDuration

Community60.3K runs

zsxkib/animate-diff

🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models with...

NegSeedGuidanceStepsFormatW/H

Community59.3K runs

luma/ray-flash-2-540p

Generate 5s and 9s 540p videos, faster and cheaper than Ray 2

DurationImg2VidLoop

Community58.4K runs

prunaai/p-video-avatar

p-video-avatar is the fastest and cheapest avatar/lipsync video model on the market.

RefsNegSeedVoice

Community57.8K runs

meta/sam-2-video

SAM 2: Segment Anything v2 (for videos)

Format

Community57.1K runs

sdsgitaccount/flux-gmoveus

Flux lora, use "GMOVEUS" to trigger movement MEME

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community56.9K runs

joehoover/mplug-owl

An instruction-tuned multimodal large language model that generates text based on user-provided prompts and images

Community55.8K runs

minimax/hailuo-2.3

A high-fidelity video generation model optimized for realistic human motion, cinematic VFX, expressive characters, and s...

DurationImg2Vid

Community55.2K runs

zsxkib/film-frame-interpolation-for-large-motion

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Community54.3K runs

minimax/music-1.5

Music-1.5: Full-length songs (up to 4 mins) with natural vocals & rich instrumentation

Community54.2K runs

minimax/hailuo-02-fast

A low cost and fast version of Hailuo 02. Generate 6s and 10s videos in 512p

DurationImg2Vid

Community53.6K runs

wan-video/wan-2.7-image

Generate and edit images with Alibaba's Wan 2.7

SeedMulti

Community52.9K runs

pixverse/pixverse-v6

PixVerse's flagship video generation model. Generate cinematic videos with synchronized audio, multi-shot sequences, and...

RefsNegSeedDurationImg2Vid

Community52.8K runs

black-forest-labs/flux-2-klein-4b-base

Un-distilled version of FLUX.2 [klein]. Optimized for fine-tuning, customization, and post-training workflows

SeedGuidanceFormatSafety

Community52.6K runs

wan-video/wan-2.5-i2v-fast

Wan 2.5 image-to-video, optimized for speed

RefsNegSeedDuration

Community52.6K runs

tencent/hunyuan-image-3

A powerful native multimodal model for image generation (PrunaAI squeezed)

SeedFormatSafety

Community52.5K runs

lucataco/deepseek-ocr

Convert documents to markdown, extract raw text, and locate specific content

Refs

Community52.4K runs

fermatresearch/magic-style-transfer

Restyle an image with the style of another one. I strongly suggest to upscale the results with Clarity AI

RefsNegSeedGuidanceStepsSchedulerMultiLoRA

Community51.9K runs

bria/generate-background

Bria Background Generation allows for efficient swapping of backgrounds in images via text prompts or reference image, d...

RefsNegSeed

Community50.5K runs

juergengunz/ultimate-portrait-upscale

Upscale Portrait Images with ControlNet Tile

RefsNegSeedGuidanceStepsScheduler

Community50.3K runs

reve/edit-fast

Reve's fast image edit model at only $0.01 per edit

Refs

Community49.1K runs

zsxkib/ic-light-background

🖼️✨Background images + prompts to auto-magically relights your images (+normal maps🗺️)

NegSeedGuidanceStepsFormatW/H

Community48.6K runs

datalab-to/marker

Convert PDF to markdown + JSON quickly with high accuracy

Community48.5K runs

prunaai/hidream-l1-dev

This is an optimised version of the hidream-l1-dev model using the pruna ai optimisation toolkit!

SeedFormat

Community48.4K runs

sync/lipsync-2-pro

Studio-grade lipsync in minutes, not weeks

Temp

Community48.1K runs

wan-video/wan-2.2-i2v-a14b

Image-to-video at 720p and 480p with Wan 2.2 A14B

RefsSeedStepsFPS

Community47.9K runs

black-forest-labs/flux-2-klein-9b-base

Un-distilled version of FLUX.2 [klein]. A foundation model for maximum flexibility and control

SeedGuidanceFormatSafety

Community47.9K runs

minimax/hailuo-2.3-fast

A lower-latency image-to-video version of Hailuo 2.3 that preserves core motion quality, visual consistency, and styliza...

DurationImg2Vid

Community47.6K runs

wan-video/wan-2.1-1.3b

Generate 5s 480p videos. Wan is an advanced and powerful visual generation model developed by Tongyi Lab of Alibaba Grou...

SeedSteps

Community47.3K runs

wan-video/wan-2.7-i2v

Generate videos from images, with support for first-and-last-frame control, clip continuation, and audio synchronization...

NegSeedDuration

Community47.1K runs

wan-video/wan-2.2-animate-replace

Use Wan 2.2 Animate to replace a character in a video scene

SeedFPS

Community45.5K runs

grandlineai/instant-id-photorealistic

InstantID : Zero-shot Identity-Preserving Generation in Seconds. Using Juggernaut-XL v8 as the base model to encourage p...

RefsNegGuidanceStepsW/H

Community45.4K runs

wan-video/wan-2.6-i2v

Alibaba Wan 2.6 image to video generation model

RefsNegSeedDuration

Community45.3K runs

zsxkib/seedvr2

🔥 SeedVR2: one-step video & image restoration with 3B/7B hot‑swap and optional color fix 🎬✨

SeedGuidanceStepsFormatFPS

Community45.1K runs

minimax/voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

Community44.7K runs

tencent/hunyuan-3d-3.1

3D models with texture fidelity and geometry precision

Refs

Community44.5K runs

luma/ray-flash-2-720p

Generate 5s and 9s 720p videos, faster and cheaper than Ray 2

DurationImg2VidLoop

Community44.5K runs

wan-video/wan-2.5-t2v-fast

Wan 2.5 text-to-video, optimized for speed

NegSeedDuration

Community44.0K runs

cjwbw/night-enhancement

Unsupervised Night Image Enhancement

Refs

Community44.0K runs

bingbangboom-lab/flux-dreamscape

Flux lora, use "BSstyle004" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community43.6K runs

lucataco/pasd-magnify

(Academic and Non-commercial use only) Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalize...

RefsNegSeedGuidance

Community43.4K runs

davisbrown/flux-half-illustration

Flux lora, use "in the style of TOK" to trigger generation, creates half photo half illustrated elements

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community42.8K runs

cjwbw/text2video-zero

Text-to-Image Diffusion Models are Zero-Shot Video Generators

NegSeedFPS

Community42.0K runs

reve/remix

Image generation model from Reve which handles multiple input reference images

Refs

Community41.4K runs

google/veo-3.1-lite

Google's cost-efficient video generation model with native audio, optimized for high-volume applications

RefsSeedDuration

Community41.4K runs

wan-video/wan2.1-with-lora

Run Wan2.1 14b or 1.3b with a lora

RefsNegSeedSteps

Community41.3K runs

pixverse/pixverse-v4

Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p

RefsNegSeedDurationImg2Vid

Community41.1K runs

veed/fabric-1.0

VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video

Refs

Community41.1K runs

bytedance/omni-human-1.5

A film-grade digital human model that generates realistic video from a single image, audio clip, and optional text promp...

RefsSeed

Community41.0K runs

x-lance/f5-tts

F5-TTS, the new state-of-the-art in open source voice cloning

Community40.5K runs

sync/lipsync-2

Generate realistic lipsyncs with Sync Labs' 2.0 model

Temp

Community40.3K runs

perceptron-ai-inc/isaac-0.1

an open-source, 2B-parameter model built for real-world applications

Refs

Community39.6K runs

qwen/qwen-image-2

A next-generation image generation and editing model from Alibaba's Qwen team. Supports text-to-image and image editing ...

RefsNegSeed

Community39.4K runs

leonardoai/phoenix-1.0

Leonardo AI’s first foundational model produces images up to 5 megapixels (fast, quality and ultra modes)

Multi

Community39.1K runs

lucataco/ip_adapter-sdxl-face

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate SDXL images with a...

RefsNegSeedStepsMultiScale

Community38.8K runs

arielreplicate/stable_diffusion_infinite_zoom

Use Runway's Stable-diffusion inpainting model to create an infinite loop video

Format

Community38.5K runs

lightricks/ltx-2.3-pro

High-fidelity video generation with portrait support, audio-to-video, retake, and extend. Text, image, and audio-driven ...

RefsDurationImg2VidFPS

Community38.4K runs

easel/ai-avatars

Use one or two face images to create AI avatars

Community37.2K runs

luma/ray-2-720p

Generate 5s and 9s 720p videos

DurationImg2VidLoop

Community36.8K runs

qwen/qwen-image-2-pro

The pro version of Qwen Image 2 from Alibaba's Qwen team. Enhanced text rendering, realism, and semantic adherence for h...

RefsNegSeed

Community36.8K runs

luma/reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

Community36.7K runs

openai/gpt-4o-transcribe

A speech-to-text model that uses GPT-4o to transcribe audio

Temp

Community35.8K runs

wavespeedai/wan-2.1-t2v-720p

Accelerated inference for Wan 2.1 14B text to video with high resolution, a comprehensive and open suite of video founda...

NegSeedStepsSafetyLoRA

Community35.5K runs

pbarker/gfpgan-video

GFPGAN for human face video upscaling

Scale

Community34.8K runs

retro-diffusion/rd-fast

Fast pixel art image generation

RefsSeedW/HMulti

Community34.2K runs

prunaai/hidream-l1-full

This is an optimised version of the hidream-full model using the pruna ai optimisation toolkit!

SeedFormat

Community33.9K runs

lucataco/orpheus-3b-0.1-ft

Orpheus 3B - high quality, emotive Text to Speech

TempVoice

Community33.6K runs

ismail-seleit/formfinder-flux

Flux version of FormFinder-XL - trained to create moody atmospheric images but is quite versatile to be mixed with other...

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community33.6K runs

kwaivgi/kling-lip-sync

Add lip-sync to any video with an audio file or text

Voice

Community33.3K runs

lucataco/videollama3-7b

VideoLLaMA 3: Frontier Multimodal Foundation Models for Video Understanding

FPSTemp

Community32.7K runs

apolinario/flux-tarot-v1

Flux lora, use "in the style of TOK a trtcrd tarot style" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community32.6K runs

chenxwh/video-retalking

Audio-based Lip Synchronization for Talking Head Video

Community32.4K runs

wan-video/wan-2.5-t2v

Alibaba Wan 2.5 text to video generation model

NegSeedDuration

Community32.4K runs

lucataco/ip-adapter-faceid

(Research only) IP-Adapter-FaceID can generate various style images conditioned on a face with only text prompts

NegSeedStepsW/HMulti

Community32.2K runs

halimalrasihi/flux-red-cinema

Cinematic Flux LoRA: Use "r3dcma" in your prompt to trigger this LoRA model.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community31.8K runs

lucataco/qwen2.5-omni-7b

Qwen2.5-Omni is an end-to-end multimodal model designed to perceive diverse modalities, including text, images, audio, a...

Refs

Community31.7K runs

fofr/face-swap-with-ideogram

Use ideogram-character to face-swap someone into a target image

Community29.6K runs

meronym/speaker-transcription

Whisper transcription plus speaker diarization

Community28.3K runs

retro-diffusion/rd-plus

High quality and authentic pixel art image generation

RefsSeedW/HMulti

Community28.0K runs

playht/play-dialog

End-to-end AI speech model designed for natural-sounding conversational speech synthesis, with support for context-aware...

SeedTempVoice

Community27.1K runs

awerks/whisperx

Fast automatic speech recognition (70x realtime with large-v2) with word-level timestamps and speaker diarization.

Community25.8K runs

sabuhigr/sabuhi-model

Whisper AI with channel separation and speaker diarization

Temp

Community25.5K runs

camenduru/tripo-sr

TripoSR: Fast 3D Object Reconstruction from a Single Image

Community25.5K runs

levelsio/disposable-camera

Take photos with a disposable camera. Like this? Use this with yourself in it on my app PhotoAI.com

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community25.4K runs

datalab-to/ocr

Detect and transcribe text in images with accurate bounding boxes, layout analysis, reding order, and table recognition,...

Community25.3K runs

wan-video/wan-2.2-animate-animation

Use Wan 2.2 Animate to copy the motion of a video to another scene

SeedFPS

Community25.1K runs

lightricks/ltx-2.3-fast

Lightning-fast video generation with portrait support, camera controls, and synchronized audio. Up to 20 seconds at 1080...

RefsDurationImg2VidFPS

Community24.8K runs

lucataco/video-merge

Simple tool to merge together separate video snippets

W/HFPS

Community24.7K runs

nvidia/parakeet-rnnt-1.1b

🗣️ Nvidia + Suno.ai's speech-to-text conversion with high accuracy and efficiency 📝

Community24.5K runs

resemble-ai/chatterbox-multilingual

Generate expressive, natural speech in 23 languages. Features instant voice cloning from short audio, emotion control, a...

SeedTemp

Community24.1K runs

lightricks/ltx-2-distilled

LTX-2: The first open source audio-video model

RefsSeed

Community23.8K runs

lucataco/video-audio-merge

merge a video and an audio file

Format

Community22.9K runs

stability-ai/stable-audio-2.5

Generate high-quality music and sound from text prompts

SeedGuidanceStepsDuration

Community22.8K runs

lightricks/ltx-2-pro

Delivers high visual fidelity with fast turnaround. Great for daily content creation, marketing teams, and iterative cre...

RefsDuration

Community22.7K runs

zsxkib/create-rvc-dataset

Create your own Realistic Voice Cloning (RVC v2) dataset using a YouTube link

Community21.5K runs

fanyiy/flux-notion-illustration

Notion-style illustration

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community21.1K runs

tencent/hunyuan-image-2.1

Generate high-quality 2K resolution images from text prompts

SeedFormatSafety

Community20.6K runs

cjwbw/supir-v0f

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0F model and does NOT use...

RefsNegSeed

Community20.5K runs

ibm-granite/granite-speech-3.3-8b

Granite-speech-3.3-8b is a compact and efficient speech-language model, specifically designed for automatic speech recog...

SeedTemp

Community20.5K runs

miike-ai/flux-ico

Create beautiful icons & emojis

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community20.0K runs

igorriti/flux-360

Generate 360 panorama images.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community19.9K runs

fofr/kontext-make-person-real

A FLUX Kontext fine-tune to fix plastic AI skin textures

RefsSeedGuidanceStepsFormatSafety

Community19.8K runs

zsxkib/aura-sr-v2

AuraSR v2: Second-gen GAN-based Super-Resolution for real-world applications

RefsFormat

Community19.7K runs

adidoes/whisperx-video-transcribe

ASR from video URL based on whisperx using large-v2 model

Community19.6K runs

kwaivgi/kling-avatar-v2

Create avatar videos with realistic humans, animals, cartoons, or stylized characters

Refs

Community19.2K runs

sourceful/riverflow-2.0-pro

Agentic image model optimized for robust, high-precision generations supporting font control

Format

Community19.1K runs

sakemin/musicgen-remixer

Remix the music into another styles with MusicGen Chord

SeedFormatTemp

Community18.8K runs

deepfates/deepfits_flux_dev

A fashion model

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community18.7K runs

codeplugtech/object_remover

No description available

Community18.3K runs

resemble-ai/chatterbox-pro

Generate expressive, natural speech with Resemble AI's Chatterbox.

SeedTempVoice

Community18.1K runs

google/lyria-3-pro

Generate full-length songs up to 3 minutes from text prompts or images with Lyria 3 Pro, Google's most capable music gen...

Seed

Community17.4K runs

prunaai/vace-14b

This is a faster VACE-14B model, optimised with pruna, contact us for more at pruna.ai

SeedSteps

Community16.5K runs

recraft-ai/recraft-creative-upscale

Creative Upscale focuses on enhancing details and refining complex elements in the image. It doesn’t just increase resol...

Refs

Community16.5K runs

moonshotai/kimi-k2.5

Moonshot AI's latest open model. It unifies vision and text, thinking and non-thinking modes, and single-agent and multi...

RefsTemp

Community16.5K runs

davisbrown/designer-architecture

Create professional architecture and interior designs

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community16.5K runs

openai/o1

OpenAI's first o-series reasoning model

Refs

Community16.4K runs

flux-kontext-apps/iconic-locations

Put yourself in an iconic location around the world from a single image

RefsSeedFormat

Community16.1K runs

zsxkib/step1x-edit

✍️Step1X-Edit by stepfun-ai, Edit an image using text prompt📸

RefsSeedFormat

Community15.8K runs

cjwbw/shap-e

Generating Conditional 3D Implicit Functions

RefsGuidance

Community15.8K runs

levelsio/analog-film

Take photos in analog film style

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community15.5K runs

alibaba/happyhorse-1.0

Alibaba's Happy Horse 1.0 generates videos from text prompts or animates a single image into video. Supports 720p and 10...

RefsSeedDuration

Community15.3K runs

moonshotai/kimi-k2.6

Moonshot AI's frontier open model, built for long-horizon coding, agent swarms, and autonomous software engineering. 1 t...

RefsTemp

Community15.3K runs

ibm-granite/granite-4.1-8b

Granite-4.1-8B is a 8B parameter long-context instruct model finetuned from Granite-4.1-8B-Base using a combination of o...

SeedTemp

Community15.2K runs

fofr/video-morpher

Generate a video that morphs between subjects, with an optional style

NegSeed

Community15.2K runs

raulduke9119/flux_realism

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community14.9K runs

cjwbw/aniportrait-audio2vid

Audio-Driven Synthesis of Photorealistic Portrait Animations

RefsSeedGuidanceStepsW/HFPS

Community14.9K runs

mv-lab/instructir

High-Quality Image Restoration Following Human Instructions

RefsSeed

Community14.8K runs

tencent/hunyuan3d-2mv

Hunyuan3D-2mv is finetuned from Hunyuan3D-2 to support multiview controlled shape generation.

SeedGuidanceSteps

Community14.6K runs

lucataco/fuyu-8b

Fuyu-8B is a multi-modal text and image transformer trained by Adept AI

Refs

Community14.6K runs

lucataco/modelscope-facefusion

Auto fuse a user's face onto the template image, with a similar appearance to the user

Community14.6K runs

aramintak/flux-softserve-anime

Flux lora, use "sftsrv style illustration" to trigger the image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community14.5K runs

qwen-edit-apps/qwen-image-edit-plus-lora-skin

Skin – Natural beauty retouch that enhances pores and tonal variation (no plastic skin) via the Skin LoRA.

RefsSeedStepsFormatSafetyLoRA

Community14.5K runs

character-ai/ovi-i2v

Ovi: generate videos with audio from image and text inputs

RefsSeed

Community14.4K runs

wan-video/wan-2.6-t2v

Alibaba Wan 2.6 text to video generation model

NegSeedDuration

Community14.4K runs

google/nano-banana-2-lite

Google's fastest image generation model — the lightweight, low-cost version of Nano Banana 2, for rapid creation and edi...

RefsFormat

Community14.4K runs

tudortotolici/newspaper_illustration

The "newspaper illustration" model specializes in creating black-and-white, cartoon-style drawings reminiscent of classi...

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community14.0K runs

bytedance/seedream-5-pro

ByteDance's flagship text-to-image and image editing model, generating sharp 1K and 2K images from text or up to 10 refe...

RefsFormat

Community13.7K runs

bria/genfill

Bria GenFill enables high-quality object addition or visual transformation. Trained exclusively on licensed data for saf...

RefsNegSeedMask

Community13.6K runs

fofr/kontext-old-and-damaged

Use this kontext fine-tune to turn any photo into an old and damaged photo

RefsSeedGuidanceStepsFormatSafety

Community13.5K runs

camenduru/metavoice

MetaVoice-1B: 1.2B parameter base model trained on 100K hours of speech

Community13.5K runs

zsxkib/dia

Dia 1.6B by Nari Labs, Generates realistic dialogue audio from text, including non-verbal cues and voice cloning

SeedGuidanceTemp

Community12.8K runs

lucataco/speaker-diarization

Segments an audio recording based on who is speaking (on A100)

Community12.8K runs

grandlineai/instant-id-artistic

InstantID : Zero-shot Identity-Preserving Generation in Seconds. Using Dreamshaper-XL as the base model to encourage art...

RefsNegGuidanceStepsW/H

Community12.3K runs

philz1337x/clarity-pro-upscaler

The first creative upscaler which keeps identity. Stunning photorealistic results, realistic skin, and full creative con...

RefsFormatScale

Community12.2K runs

arielreplicate/deoldify_video

Add colours to old video footage.

Community11.9K runs

wavespeedai/qwen-image

A 20B MMDiT model for next-gen text-to-image generation

Community11.8K runs

xai/grok-imagine-image

SOTA image model from xAI

Refs

Community11.6K runs

pellmellism/xkcd

epic xkcd comics

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community11.6K runs

bria/fibo

SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality A...

RefsNegSeedGuidance

Community11.4K runs

jd7h/zero123plusplus

Turn an image into a set of images from different 3D angles

Refs

Community11.4K runs

anthropic/claude-opus-4.6

Anthropic's most intelligent model with state-of-the-art coding, reasoning, and agentic capabilities

Refs

$0.007/img11.4K runs

openai/gpt-4o-mini-transcribe

A speech-to-text model that uses GPT-4o mini to transcribe audio

Temp

Community11.4K runs

nohamoamary/image-captioning-with-visual-attention

datasets: Flickr8k

Refs

Community11.3K runs

makinsongary698/jh

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community11.3K runs

lucataco/demofusion-enhance

Image to Image enhancer using DemoFusion

RefsNegSeedGuidanceStepsScale

Community11.2K runs

wan-video/wan-2.7-videoedit

Edit videos with natural language instructions using Alibaba's Wan 2.7 VideoEdit model

RefsSeedDuration

Community11.1K runs

luma/ray-2-540p

Generate 5s and 9s 540p videos

DurationImg2VidLoop

Community11.0K runs

lucataco/rembg-video

Video Background Removal

Community10.8K runs

cjwbw/voicecraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

SeedTemp

Community10.8K runs

black-forest-labs/flux-pro-finetuned

Inference model for FLUX.1 [pro] using custom `finetune_id`

RefsSeedGuidanceStepsFormatW/H

Community10.8K runs

tencentarc/animesr

Real-World Super-Resolution Models for Animation Videos

Community10.6K runs

leonardoai/motion-2.0

Create 5s 480p videos from a text prompt

RefsNeg

Community10.6K runs

zsxkib/animatediff-illusions

Monster Labs' Controlnet QR Code Monster v2 For SD-1.5 on top of AnimateDiff Prompt Travel (Motion Module SD 1.5 v2)

NegSeedGuidanceStepsSchedulerFormatW/HLoop

Community10.5K runs

minimax/music-2.6

Generate full-length songs or instrumentals from a text prompt, with optional auto-generated lyrics

Community10.5K runs

cuuupid/flux-lineart

Flux finetuned for black and white line art.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community10.3K runs

retro-diffusion/rd-animation

Style consistent animated pixel art sprite generation

RefsSeedW/H

Community10.2K runs

adirik/dreamgaussian

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation

Community10.1K runs

runwayml/gen-4.5

State-of-the-art video motion quality, prompt adherence and visual fidelity

RefsSeedDuration

Community10.0K runs

retro-diffusion/rd-tile

All the tools you need for generating pixel art tilesets

RefsSeedW/HMulti

Community9.9K runs

zylim0702/remove_bg

Best Human detection and Object Detection Background removal.

Refs

Community9.7K runs

charlesmccarthy/animagine-xl

Animagine XL 2.0 is an advanced latent text-to-image diffusion model designed to create high-resolution, detailed anime ...

RefsNegSeedGuidanceStepsSchedulerW/HSafetyMultiMask

Community9.4K runs

pollinations/real-basicvsr-video-superresolution

RealBasicVSR: Investigating Tradeoffs in Real-World Video Super-Resolution

Community9.3K runs

lightricks/ltx-video-0.9.7-distilled

Faster slight quality reduction compared to LTX-Video 13b

RefsNegSeedGuidanceStepsFPS

Community9.3K runs

zsxkib/pyramid-flow

Text-to-Video + Image-to-Video: Pyramid Flow Autoregressive Video Generation method based on Flow Matching

RefsGuidanceDurationFPS

Community9.2K runs

ibm-granite/granite-3.1-2b-instruct

Granite-3.1-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following t...

Temp

Community9.2K runs

zsxkib/flux-music

🎼FluxMusic Text-to-Music Generation with Rectified Flow Transformer🎶

NegSeedGuidanceSteps

Community8.9K runs

heygen/video-translate

Translate videos into over 150 languages

Community8.9K runs

minimax/music-2.5

Generate full-length songs with vocals, lyrics, and rich instrumentation from a text prompt

Community8.9K runs

lucataco/kontext-realearth

This Kontext LoRA turns basic satellite images into quality drone shots

RefsSeedGuidanceStepsFormatSafety

Community8.8K runs

cjwbw/face-align-cog

face alignment using stylegan-encoding

Refs

Community8.7K runs

flux-kontext-apps/filters

Add simple filters to your images

RefsSeedFormat

Community8.6K runs

levelsio/neon-tokyo

Take photos in the style of rainy Tokyo nights with neon lights

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community8.6K runs

elevenlabs/v3

The most expressive Text to Speech model

Voice

Community8.5K runs

prunaai/p-image-try-on

Virtual try-on. Put one or more garments onto a person photo while keeping their face, pose, and body.

SeedFormat

Community8.3K runs

zsxkib/thinksound

Generate contextual audio from video using step-by-step reasoning🎶

SeedGuidanceSteps

Community8.3K runs

lucataco/smolvlm-instruct

SmolVLM-Instruct by HuggingFaceTB

Refs

Community8.3K runs

elevenlabs/scribe-v2

Transcribe speech with ElevenLabs Scribe v2. 90+ languages, word-level timestamps, speaker diarization for up to 32 spea...

SeedTemp

Community8.1K runs

prunaai/hunyuan3d-2

hunyuan3d-2 optimised with the pruna toolkit: https://github.com/PrunaAI/pruna

Steps

Community8.0K runs

lucataco/stable-diffusion-x4-upscaler

Stable Diffusion x4 upscaler model

RefsScale

Community8.0K runs

luma/modify-video

Modify a video with style transfer and prompt-based editing

Community8.0K runs

afterpeak/flux-slowed

Flux LORA to generate images in the style of the arworks used for sowed versions of a song

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community7.3K runs

lucataco/flux-watercolor

A Flux LoRA trained on watercolor style photos

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community7.3K runs

tmappdev/lipsync

Lipsync model using MuseTalk

FPS

Community7.3K runs

halimalrasihi/flux-mystic-animals

Flux LoRA: Use "m1st1c" in your prompt to trigger this LoRA model.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community7.2K runs

cudanexus/ocr-surya

Surya is a document OCR toolkit that does:

Refs

Community7.0K runs

justmalhar/flux-thumbnails

Generate 16:9 Thumbnails. Use prefix - `Thumbnail in the style of TOK`

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community6.9K runs

lucataco/llama-3-vision-alpha

Projection module trained to add vision capabilties to Llama 3 using SigLIP

Refs

Community6.8K runs

fofr/flux-minecraft-movie

Flux lora, use "MNCRFTMOV" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community6.7K runs

pixverse/pixverse-v5.6

Latest video model from Pixverse with astonishing physics

RefsNegSeedDurationImg2Vid

Community6.6K runs

flux-kontext-apps/impossible-scenarios

Experience impossible adventures and extreme scenarios from a single image

RefsSeedFormat

Community6.3K runs

recraft-ai/recraft-v4.1

Recraft's latest image generation model, built around design taste. Strong prompt accuracy, art-directed composition, an...

Community6.3K runs

sourceful/riverflow-2.0-fast

Agentic image model optimized for high-quality, fast generations supporting font control

Format

Community6.3K runs

sebastianbodza/flux_lora_retro_linedrawing_style_v1

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community6.3K runs

justmalhar/flux-sketchnotes

Generates hand-drawn sketchnotes with great detail. Prompt Prefix: “A sketchnote in the style of TOK”

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community6.1K runs

ibm-granite/granite-vision-4.1-4b

Granite Vision 4.1 4B is a vision-language model (VLM) that delivers frontier-level performance on structured document e...

SeedTemp

Community6.1K runs

0xtuba/archillect-lora

Generates images in the style of Archillect

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community6.1K runs

ideogram-ai/layerize

Take a flat graphic, remove text, and get structured text layers back for editing and recomposing

Seed

Community6.0K runs

aramintak/flux-koda

Flux lora, use "flmft style" to trigger the image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community6.0K runs

zsxkib/bsrgan

Upscale videos + images with BSRGAN

Scale

Community5.9K runs

zsyoaoa/invsr

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Seed

Community5.9K runs

inworld/realtime-tts-2

Most expressive text-to-speech model from Inworld, with natural-language steering, real-time latency, and multilingual s...

TempVoice

Community5.7K runs

zsxkib/animatediff-prompt-travel

🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narrative...

NegSeedGuidanceStepsSchedulerFormatW/H

Community5.7K runs

collectiveai-team/speaker-diarization-3

Segments an audio recording based on who is speaking

Community5.7K runs

fofr/flux-80s-cyberpunk

A flux lora trained on a 1980s cyberpunk aesthetic

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community5.6K runs

black-forest-labs/flux-2-klein-9b-base-lora

A version of FLUX.2 [klein] 9B-base that supports fast fine-tuned lora inference

SeedFormatSafetyLoRA

Community5.6K runs

recraft-ai/recraft-v4

Recraft's latest image generation model, built around design taste. Strong prompt accuracy, art-directed composition, an...

Community5.5K runs

mirelo/video-to-sfx-v1

Generate synced sounds for any video, and return it with its new sound track

SeedStepsDuration

Community5.4K runs

flux-kontext-apps/kontext-emoji-maker

Use kontext to turn any image into an emoji, using a lora by starsfriday

RefsFormat

Community5.4K runs

xai/grok-imagine-video-extension

Extend videos with xAI's Grok Imagine Video model. Provide a source video and describe what happens next.

Duration

Community5.3K runs

jakedahn/flux-latentpop

flux-latentpop features vibrant backgrounds with grungy limited screenprinting color goodness.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community5.1K runs

qwen-edit-apps/qwen-image-edit-plus-lora-next-scene

Next Scene – “Next beat” cinematic edits that keep subject identity while steering to the next camera move via the Next ...

RefsSeedStepsFormatSafetyLoRA

Community5.1K runs

wavespeedai/hunyuan-video-fast

Accelerated inference for HunyuanVideo with high resolution (1280x720), a state-of-the-art text-to-video generation mode...

Community5.1K runs

zsxkib/flash-face

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

NegSeedStepsFormat

Community5.0K runs

awilliamson10/meta-nougat

Nougat: Neural Optical Understanding for Academic Documents

Community4.9K runs

elevenlabs/turbo-v2.5

High quality, low latency text to speech in 32 languages

Voice

Community4.9K runs

moonshotai/kimi-k2-thinking

Kimi K2 Thinking is the latest, most capable version of an open-source thinking model.

Temp

Community4.8K runs

pwntus/flux-albert-einstein

A fine-tuned FLUX.1 model. Use trigger word "EINSTEIN". Created with ReFlux (https://reflux.replicate.dev).

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community4.8K runs

openai/gpt-5-pro

The smartest, fastest, most useful model yet, with built-in thinking that puts expert-level intelligence in everyone’s h...

Refs

Community4.8K runs

elevenlabs/music

Compose a song from a prompt or a composition plan

Format

Community4.7K runs

cjwbw/docentr

End-to-End Document Image Enhancement Transformer

Refs

Community4.7K runs

ibm-granite/granite-embedding-278m-multilingual

Granite-Embedding-278M-Multilingual is a 278M parameter model from the Granite Embeddings suite that can be used to gene...

Community4.5K runs

codingdudecom/flux-kontext-stencil-lora

Stencil maker - create a black and white stencil image from any photo

RefsSeedGuidanceStepsFormatSafety

Community4.5K runs

lucataco/extract-audio

Simple tool to extract audio from a video file

Format

Community4.4K runs

lucataco/ip_adapter-face-inpaint

A combination of ip_adapter SDv1.5 and mediapipe-face to inpaint a face

SeedMulti

Community4.3K runs

lucataco/controlnet-tile

Controlnet v1.1 - Tile Version

RefsSeedStepsScale

Community4.2K runs

camenduru/lgm

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

RefsNegSeed

Community4.2K runs

fofr/flux-mjv3

Flux lora trained on Midjourney v3 outputs from 2022, use "a dream, in the style of MJV3" to trigger generation, also tr...

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community4.1K runs

lightricks/ltx-2-retake

Take any shot and edit specific sections. Rephrase, change the action, camera angles and more

Duration

Community4.0K runs

veryvanya/flux-ps1-style

Flux lora, use "ps1 game screenshot" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community4.0K runs

lucataco/ollama-llama3.2-vision-11b

Ollama Llama 3.2 Vision 11B

RefsTemp

Community4.0K runs

bria/product-shadow

Add consistent, customizable shadows to product cutouts for enhanced visual appeal

Refs

Community3.9K runs

kwaivgi/kling-v1.5-pro

Generate 5s and 10s videos in 1080p resolution at 30fps

NegGuidanceDurationImg2Vid

Community3.9K runs

lucataco/ollama-llama3.2-vision-90b

Ollama Llama 3.2 Vision 90B

RefsTemp

Community3.8K runs

prunaai/sdxl-lightning

This is the fastest sdxl-lightning endpoint in the world on A100, contact us for more at pruna.ai

SeedGuidanceStepsFormatMulti

Community3.8K runs

jbilcke/flux-dev-panorama-lora

A flux lora for panoramas, use 21:9 and "HDRI panoramic view of TOK" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community3.8K runs

zeke/ziki-flux

A Flux fine-tune of https://replicate.com/zeke the real-life human. Use "ZIKI" in the prompt to activate the trained sty...

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community3.7K runs

fofr/flux-color

Flux lora, use "CLR" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community3.6K runs

fofr/flux-mona-lisa

Flux lora, use the term "MNALSA" to trigger generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community3.6K runs

qwen-edit-apps/qwen-image-edit-plus-lora-relight

Relight – Soft, curtain-filtered relighting that repaints the scene with golden-hour or moody tones using the Relight Lo...

RefsSeedStepsFormatSafetyLoRA

Community3.6K runs

sebastianbodza/flux_aquarell_watercolor_style

A watercolor Aquarell style lora for flux

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community3.5K runs

sakemin/musicgen-stereo-chord

Generate music in stereo, restricted to chord sequences and tempo

SeedFormatDurationTemp

Community3.4K runs

zsxkib/aura-sr

AuraSR: GAN-based Super-Resolution for real-world

RefsScale

Community3.3K runs

openai/o1-mini

A small model alternative to o1

Community3.3K runs

pixverse/pixverse-v3.5

Create videos in as little as 10 seconds. 5s or 8s videos at 360p, 540p, 720p or 1080p.

RefsNegSeedDurationImg2Vid

Community3.3K runs

qwen-edit-apps/qwen-image-edit-plus-lora-photo-to-anime

Photo to Anime – Stylized conversion that turns photos into crisp cel-shaded anime frames using the Photo-to-Anime LoRA.

RefsSeedStepsFormatSafetyLoRA

Community3.3K runs

zsxkib/multitalk

Audio-driven multi-person conversational video generation - Upload audio files and a reference image to create realistic...

RefsSeed

Community3.3K runs

hyper3d/rodin

Generate complex 3D models from images with Rodin Gen-2

Seed

Community3.2K runs

fofr/0_1-webp

Make pictures of an AI character named 0_1.webp

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community3.2K runs

levelsio/lomography

Take photos in the style of a Lomography camera

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community3.2K runs

genmoai/mochi-1

Mochi 1 preview is an open video generation model with high-fidelity motion and strong prompt adherence in preliminary e...

SeedGuidanceStepsFPS

Community3.2K runs

prunaai/p-video-animate

p-video-animate animates a reference image with the motion and audio of a source video. Optimized for speed and cost — 5...

RefsSeedSafety

Community3.2K runs

adirik/wonder3d

Generates 3D assets from images

Refs

Community3.1K runs

wan-video/wan-2.7-t2v

Generate videos with audio from text prompts using Alibaba's Wan 2.7 model. 1080p, up to 15 seconds, with audio synchron...

NegSeedDuration

Community3.1K runs

sakemin/musicgen-chord

Generate music restricted to chord sequences and tempo

SeedFormatDurationTemp

Community3.1K runs

cuuupid/marker

Convert scanned or electronic documents to markdown, very very very fast

Community3.1K runs

zsxkib/hunyuan-video2video

A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from tex...

SeedGuidanceStepsW/H

Community3.0K runs

lucataco/magnet

MAGNeT: Masked Audio Generation using a Single Non-Autoregressive Transformer

Temp

Community2.9K runs

anthropic/claude-sonnet-4.6

Claude Sonnet 4.6 from Anthropic: a full upgrade to coding, computer use, long-context reasoning, agent planning, knowle...

Refs

Community2.9K runs

flux-kontext-apps/depth-of-field

Bring your subjects into focus with FLUX.1 Kontext [pro]

RefsSeedFormat

Community2.8K runs

cjwbw/parler-tts

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

Community2.8K runs

vidu/q3-pro

High-fidelity video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 108...

SeedDurationImg2Vid

Community2.7K runs

zsxkib/idefics3

Idefics3-8B-Llama3, Answers questions and caption about images

RefsTemp

Community2.6K runs

cuuupid/cogvideox-5b

Generate high quality videos from a prompt

SeedGuidanceStepsMulti

Community2.6K runs

lightricks/ltx-video-0.9.7

DiT-based 13b video generation model, creating 30fps video

RefsNegSeedGuidanceStepsW/HFPS

Community2.6K runs

brunnolou/flux-texture-abstract-painting

Turn anything into an abstract fine art masterpiece 🎨

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community2.5K runs

openai/dall-e-2

The original classic DALLᐧE 2

Community2.5K runs

wan-video/wan-2.7-r2v

Generate videos from reference images or clips while preserving subject identity using Alibaba's Wan 2.7 reference-to-vi...

RefsNegSeedDuration

Community2.5K runs

aramintak/mooniverse

Trigger phrase: surreal style

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community2.5K runs

prunaai/p-image-lora

Use trained LoRAs from the https://replicate.com/prunaai/p-image-trainer. Find or contribute LoRAs here https://huggingf...

SeedW/HSafetyLoRA

Community2.4K runs

cjwbw/controlvideo

Training-free Controllable Text-to-Video Generation

SeedGuidanceSteps

Community2.4K runs

markredito/90sbadtrip

A LoRA for Flux.1 Dev to re-create really bad and trippy CGI from the 90s.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community2.4K runs

zsxkib/uform-gen

🖼️ Super fast 1.5B Image Captioning/VQA Multimodal LLM (Image-to-Text) 🖋️

Refs

Community2.3K runs

fofr/not-real

Make a very realistic looking real-world AI video

Community2.3K runs

willywongi/donut

Extract structured data from receipt images using Donut 🍩 (Document Understanding Transformer)

Refs

Community2.3K runs

xai/grok-speech-to-text

Transcribe audio to text with xAI's Grok. Handles 25 languages, word-level timestamps, speaker diarization, multichannel...

Community2.3K runs

bytedance/dreamactor-m2.0

Animate any character, humans, cartoons, animals, even non-humans, from a single image + driving video

Refs

Community2.3K runs

lucataco/omnigen2

OmniGen2: a powerful and efficient unified multimodal model

RefsNegSeedStepsSchedulerW/H

Community2.2K runs

felixyifeiwang/eom-phase1

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community2.2K runs

fofr/flux-2004

A flux dev lora fine-tuned on bad 2004 digital photography

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community2.2K runs

qwen-edit-apps/qwen-image-edit-plus-lora-upscale

Upscale – Detail-loving upscale/restore pass that sharpens textures and color fidelity with the Upscale LoRA.

RefsSeedStepsFormatSafetyLoRA

Community2.2K runs

recraft-ai/recraft-v4-pro

Recraft's latest image generation model at ~2048px resolution. Same design taste and prompt accuracy as V4, with higher ...

Community2.1K runs

fofr/kontext-ps1

FLUX Kontext fine-tune that let's you restyle any image as a PS1 or PS2 video game

RefsSeedGuidanceStepsFormatSafety

Community2.1K runs

darionaviar/cinthia

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community2.1K runs

kwaivgi/kling-o1

Modify an existing video through natural-language commands, changing subjects, environments, and visual style while pres...

RefsDurationImg2Vid

Community2.1K runs

bytedance/video-upscaler

Upscale and enhance video up to 4K at 60fps, with scene-aware presets for AI-generated content, short dramas, UGC, and f...

Community2.0K runs

lucataco/flux-vlta

A Flux finetune of an AI character named: Violeta

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.9K runs

prunaai/p-video-replace

p-video-replace swaps the person in a video with one from a reference image, keeping motion, timing, camera, and scene e...

SeedSafety

Community1.9K runs

lightricks/audio-to-video

Use audio input with an image or prompt to generate videos

RefsGuidance

Community1.9K runs

anthropic/claude-fable-5

Claude Fable 5 from Anthropic: the next generation of intelligence for the hardest knowledge work and coding problems.

Refs

Community1.9K runs

replit/replit-code-v1-3b

Generate code with Replit's replit-code-v1-3b large language model

Community1.9K runs

bria/video-remove-background

Automatically remove backgrounds from videos -perfect for creating clean, professional content without a green screen.

Community1.8K runs

camenduru/one-shot-talking-face

one-shot-talking-face-replicate

Community1.8K runs

karlvann/karl

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.8K runs

shapestudio/floating-flux

Retro style Flux Lora

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.8K runs

qwen-edit-apps/qwen-image-edit-plus-lora-fusion

Fusion – Product/object blending that fixes perspective and lighting so the subject melts into a new background via the ...

RefsSeedStepsFormatSafetyLoRA

Community1.8K runs

krea/krea-2-medium

Foundation image model from Krea, tuned for expressive illustration, anime, and painterly styles. Fast and consistent ac...

RefsSeed

Community1.7K runs

philz1337x/crystal-video-upscaler

High-precision video upscaler optimized for portraits, faces and products. One of the upscale modes powered by Clarity A...

Scale

Community1.7K runs

fofr/flux-neo-1x

Flux lora, fine tuned on NEO-1X robot, use "NEO1X" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.7K runs

luma/ray-3.2

Luma's reasoning video model. Generates cinematic 5s or 10s video from text or images, with native HDR and EXR export fo...

DurationImg2VidLoop

Community1.6K runs

aramintak/enna-sketch-style

A hand drawn sketch style LoRA

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.6K runs

adirik/flux-fantasy-architecture

Flux lora, use "in the style of FNTSYRCH" to trigger

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.6K runs

davisbrown/photo-glow

Flux LoRA, trigger style with "in the style of TOK", Photographic images with a beautiful glow

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.6K runs

fofr/deprecated-batch-image-captioning

A wrapper model for captioning multiple images using GPT, Claude or Gemini, useful for lora training

Community1.6K runs

ideogram-ai/ideogram-v4-quality

The highest quality Ideogram v4 model. v4 creates images with stunning realism, creative designs, and consistent styles

Community1.6K runs

bria/product-cutout

Precise AI-powered product cutout with 256-level transparency for eCommerce

Refs

Community1.5K runs

adirik/imagedream

Image-Prompt Multi-view Diffusion for 3D Generation

RefsNegSeedGuidance

Community1.5K runs

linoytsaban/flux-yarn-art

Flux lora, use "yarn art style" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.5K runs

heygen/lipsync-precision

High-accuracy lip-sync: replace or dub audio on any video with avatar-inference lip sync

Community1.5K runs

fermatresearch/spanish-f5-tts

A F5-TTS fine-tuned for Spanish

Community1.5K runs

fofr/flux-jwst

Flux fine-tuned on JWST deep space astrophotography

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.5K runs

aramintak/flux-film-foto

Flux lora in a realistic film style. Use flmft photo style to trigger the image generation.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.5K runs

gauravk95/sadtalker-video

Make your video talk anything

Community1.5K runs

kwaivgi/kling-v1.5-standard

Generate 5s and 10s videos in 720p resolution at 30fps

NegGuidanceDurationImg2Vid

Community1.4K runs

recraft-ai/recraft-v4.1-svg

Generate production-ready SVG vector images from text prompts. Recraft V4.1's design taste applied to vector output — cl...

Community1.4K runs

vidu/q3-turbo

Fast video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with s...

SeedDurationImg2Vid

Community1.4K runs

andreasjansson/flux-goo

FLUX.1 [dev] trained on the Replicate goo

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.4K runs

lucataco/flux-syd-mead

Flux finetune trained on Syd Mead concept art for Blade Runner

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.4K runs

qunash/iphone_camera_style

Trained on iPhone photos of Tokyo. Add "shot on TOKSTYL camera" at the end of your prompts.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.4K runs

lucataco/wan-2.1-1.3b-vid2vid

Wan 2.1 1.3b Video to Video. Wan is a powerful visual generation model developed by Tongyi Lab of Alibaba Group

NegSeedGuidanceStepsFPS

Community1.3K runs

aramintak/flux-frosting-lane

Flux lora, use "frstingln illustration" to trigger the image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.3K runs

datacte/flux-synthetic-anime

Flux lora, use "1980s anime screengrab", "VHS quality", or "syntheticanime" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.3K runs

adirik/texture

Generate texture for your mesh with text prompts

SeedGuidance

Community1.3K runs

fofr/kontext-fix-jpeg-compression

Use this flux-kontext fine-tune to fix JPEG compression artifacts

RefsSeedGuidanceStepsFormatSafety

Community1.3K runs

bytedance/dolphin

Document Image Parsing via Heterogeneous Anchor Prompting

Format

Community1.3K runs

black-forest-labs/flux-2-klein-4b-base-lora

A version of FLUX.2 [klein] 4B-base that supports fast fine-tuned lora inference

SeedFormatSafetyLoRA

Community1.3K runs

genericdeag/comic-style

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.2K runs

elevenlabs/v2-multilingual

Generate multilingual text-to-speech audio in over 30 languages

Voice

Community1.2K runs

recraft-ai/recraft-v4-svg

Generate production-ready SVG vector images from text prompts. Recraft V4's design taste applied to vector output — clea...

Community1.2K runs

lucataco/csm-1b

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and ...

Community1.2K runs

fishwowater/trellis2

TRELLIS.2: Native and Compact Structured Latents for 3D Generation

RefsSeed

Community1.2K runs

fofr/flux-y2k

Flux lora, use "Y2K" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.2K runs

adirik/mvdream

Generate 3D assets using text descriptions

NegSeedGuidance

Community1.1K runs

lucataco/singing_voice_conversion

Amphion Singing Voice Conversion: DiffWaveNetSVC

Community1.1K runs

zsxkib/stable-video-face-restoration

SVFR: A Unified Framework for Generalized Video Face Restoration

SeedStepsMask

Community1.1K runs

buildingwithai/ai-jo

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.1K runs

janghaludu/kocchaga

Flux finetune that is trained on Glitches generated by a Processing Script ( In Readme ), WIP.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.0K runs

fofr/flux-bad-70s-food

Flux dev lora trained on photos of 1970s food

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community1.0K runs

recraft-ai/recraft-v4.1-pro

Recraft's latest image generation model at ~2048px resolution. Same design taste and prompt accuracy as V4.1, with highe...

Community1.0K runs

fofr/flux-macro-texture

Flux lora, trained on macro textures, use "MCROTX" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community992 runs

cddietz/michael

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community987 runs

jayenkai/derek

Cartoon Derek is happy, sometimes

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community979 runs

sync/react-1

Realistic lipsync with refined human emotion capabilities

Temp

Community975 runs

bria/product-packshot

Transform any product photo into professional 2000x2000px packshots with optimal positioning

Refs

Community975 runs

tjrndll/flux-dev-woodcut-prints

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community973 runs

fofr/flux-cassette-futurism

Flux lora, use "cassette futurism" to trigger generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community957 runs

minimax/music-cover

Reimagine any song in a different style — change voice, instruments, genre, and arrangement while keeping the original m...

Community952 runs

andreasjansson/flux-shapes

Flux LoRA trained on generated images of random shapes

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community935 runs

heygen/video-agent

Turn a text prompt into a complete, polished video with AI-generated script, avatar presenter, voiceover, visuals, and e...

Voice

Community928 runs

fofr/flux-cyberpunk-typeface

Flux lora by AggravatingScree7189, use "cyberpunk typeface" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community911 runs

fofr/flux-spitting-image

Flux lora, use "spitting image caricature" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community906 runs

ugleh/flux-dev-lora-dixit

Flux LoRA, use 'DIXIT' to trigger generation, creates images with the art style of the board game DIXIT.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community904 runs

camenduru/lgm-ply-to-glb

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

Community903 runs

$mickeybeurskens/latex-ocr$

mickeybeurskens/latex-ocr

Optical character recognition to turn images of latex equations into latex format.

Community874 runs

lucataco/split-screen-video

Combines two videos into a single split-screen layout

Community865 runs

fofr/flux-cross-section

Flux lora, use "XSEC cross section" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community829 runs

qunash/circassian-culture-flux-3000-steps

A FLUX.1 model fine-tuned on Circassian culture images

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community825 runs

marckohlbrugge/flux-wojak-v2

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community821 runs

oshtz/flux-plastic3d

flux.1 'plastic3d' style lora

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community820 runs

allthingsai69/sydneysweeney

Dream-girl Sydney Sweeney ready for your creative liberties!

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community796 runs

fermatresearch/dpo-sdxl-controlnet-lora

DPO-SDXL Canny controlnet with LoRA support.

RefsNegSeedGuidanceStepsSchedulerMultiLoRA

Community795 runs

justmalhar/flux-mobile-ui

Generate detailed mobile app user interfaces (UIs) using prefix “MOBIUI containing..” in 4-15 steps.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community781 runs

recraft-ai/recraft-v4-pro-svg

Generate detailed SVG vector graphics from text prompts. Recraft V4 Pro's design taste with more geometric detail and fi...

Community774 runs

rainer1966-de/flux_rainer

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community756 runs

fofr/flux-wrong

Flux lora, use “WRNG” to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community753 runs

bytedance/seedance-2.0-mini

A lower-cost variant of Seedance 2.0 for high-volume video generation with multimodal inputs and native audio.

RefsSeedDurationImg2Vid

Community749 runs

fofr/kontext-0_1-webp

No description available

RefsSeedGuidanceStepsFormatSafety

Community746 runs

darionaviar/dario_naviar

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community727 runs

tokaito14/fullbody

Model

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community725 runs

jamorphy/moebius-flux-lora

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community723 runs

cdreier/golang-gopher-flux

a fine tuned gopher flux LoRA - the trigger word is GOGOPH

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community701 runs

nateraw/autotune

pitch correction on your voice

FormatScale

Community692 runs

alibaba/happyhorse-1.1

Alibaba's Happy Horse 1.1 generates videos from text, animates a single image, or builds a video from multiple reference...

SeedDuration

Community684 runs

ori299/mc-thumbnails-v1

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community683 runs

prunaai/vace-1.3b

This is VACE-1.3B model optimised with pruna ai. Wan2.1 VACE is an all-in-one model for video creation and editing.

SeedSteps

Community678 runs

sourceful/riverflow-v2.5-fast

Speed-optimized variant of Riverflow 2.5 for production and latency-sensitive workflows

Format

Community671 runs

cuuupid/qwen2-vl-2b

SOTA open-source model for chatting with videos and the newest model in the Qwen family

W/HTemp

Community659 runs

ideogram-ai/ideogram-v4-balanced

Balance speed, quality and cost. Ideogram v4 creates images with stunning realism, creative designs, and consistent styl...

Community656 runs

flux-kontext-apps/restyle-video-frame

Use flux-kontext-pro to change the first or last frame of a video. Useful to use as inputs for restyling an entire video...

SeedFormat

Community647 runs

andreasjansson/flux-me

flux trained on me

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community642 runs

anthropic/claude-sonnet-5

Anthropic's most agentic Sonnet model, bringing frontier-level coding and tool use at Sonnet's speed and price

Refs

Community619 runs

nicolas7894/flux-undraw

Undraw Illustration Generator

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community610 runs

intelligent-utilities/html-to-image

No description available

W/H

Community582 runs

heygen/lipsync-speed

Fast lip-sync: replace or dub audio on any video with quick audio-driven lip sync

Community581 runs

tahercoolguy/video_background_remover_appender

Remove Background of video and add yours

FPS

Community579 runs

tokaito14/portrait

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community574 runs

lucataco/pheme

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

TempVoice

Community573 runs

krea/krea-2-large

Krea's flagship foundation image model. Larger and more flexible than Krea 2 Medium, with particular strength in photore...

RefsSeed

Community562 runs

lucataco/deep3d

Deep3D: Real-Time end-to-end 2D-to-3D Video Conversion, based on deep learning

Community558 runs

fofr/kontext-long-exposure-for-water

Edit photos of water to be a long exposure using this kontext fine-tune

RefsSeedGuidanceStepsFormatSafety

Community545 runs

platform-kit/mars5-tts

A novel speech model for insane prosody.

Temp

Community541 runs

mandelavybe/jungkook

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community524 runs

heygen/avatar-iv

Create realistic talking avatar videos from text with HeyGen's Avatar IV engine

W/HVoice

Community508 runs

recraft-ai/recraft-v4.1-pro-svg

Generate detailed SVG vector graphics from text prompts. Recraft V4.1 Pro's design taste with more geometric detail and ...

Community508 runs

digitaljohn/urban-narrative

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community506 runs

andreasjansson/wan-1.3b-inpaint

Inpainting and video2video experiments with Wan 2.1

NegSeedFPS

Community498 runs

codingfu/bayc

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community496 runs

evtn/pouyippie

A flux.1 fine-tune trained on pictures of a pou plush toy. Trigger is "pouyippie"

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community476 runs

juliananev/frutiger-aero

Create your very own iconic frutiger aero inspired images!

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community472 runs

sourceful/riverflow-v2.5-pro

Top-quality agentic image model with multi-step reasoning, candidate scoring, and adjustable thinking effort

Format

Community468 runs

dcamsdev/klm-lora-flux

Creates realistic KLM flight attendants

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community467 runs

arcanite24/animation2k-flux

Flux lora inspired by early 2000s animation movies, use 1.2 guidance

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community465 runs

fofr/qwen-fantasy-art

Qwen fine-tuned on fantasy art

NegSeedGuidanceStepsFormatW/HLoRA

Community464 runs

fofr/qwen-midjourney-v3

Qwen fine-tuned on Midjourney v3 images

NegSeedGuidanceStepsFormatW/HLoRA

Community460 runs

aramintak/linnea-flux-beta

An original character LoRA

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community453 runs

nicknaskida/whisper-diarization

⚡️ Insanely Fast audio transcription | whisper large-v3 | speaker diarization | word & sentence level timestamps | promp...

Community451 runs

fofr/flux-pixar-cars

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community448 runs

adirik/mvdream-multi-view

Multi-view image generation with MVDream

Community445 runs

oshtz/flux-celpast

flux.1 'celestial pastel' lora

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community438 runs

shapestudio/nihon-flux

Fine tuned Lora Japanese style reference. Defaults to traditional red, grey and blue.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community429 runs

valllllex/cartoonia_3d

Restyling the classic cartoon and comic book characters into a soft painterly 3D style. FLUX Kontext Lora

RefsSeedGuidanceStepsFormatSafety

Community427 runs

chiasabah/flux-pepe

flux-pepe(Prompt-Enabled Picture Evolution) generates fictional internet parody characters.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community418 runs

emunozhern/fluxtok

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community417 runs

sourceful/riverflow-2.0-refsr

Render product images with 100% accuracy and environmental blending

Refs

Community401 runs

recraft-ai/recraft-v4.1-utility

A faster, lighter Recraft image generation model optimized for high-volume and production pipelines. Same design taste a...

Community401 runs

ferunelli/chadmeme

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community399 runs

jiht76/mindyflux

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community393 runs

ckizer/ckizer-64

This model generates photo portraits of Court Kizer. Use the trigger word "ckizer" in your prompt. EX: "A professional p...

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community388 runs

adirik/text2tex

[Non-commercial] Generate texture for 3D assets using text descriptions

NegSeed

Community388 runs

zeke/tarot-flux

This model is not great. Check out this other one that's better: https://replicate.com/apolinario/flux-tarot-v1

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community365 runs

recraft-ai/recraft-v4.1-utility-pro

A faster, lighter Recraft image generation model at ~2048px resolution, optimized for high-volume production. Design tas...

Community365 runs

jan890/flux_dev_ascii_art

This AI ASCII Image Generator converts text prompts into detailed ASCII art images. Just include the trigger word "ASCII...

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community355 runs

elevenlabs/flash-v2.5

ElevenLabs's fastest speech synthesis model

Voice

Community351 runs

qwen/qwen-image-lora-trainer-legacy

Fine-tunable Qwen Image model with exceptional composition abilities - train custom LoRAs for any style or subject

NegSeedGuidanceStepsFormatW/HLoRA

Community351 runs

ctrimm/backyard-sports-character-creator

Creates Outputs in the Style of Backyard Sports Characters

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community343 runs

sylvesteraswin/sylvester-flux-selfie

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community332 runs

s76354m/fluxfinetune

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community328 runs

deepfates/flux-hot-zuck

A fine-tuned FLUX.1 model. Use trigger word "ZUCK". Created with ReFlux (https://reflux.replicate.dev).

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community324 runs

lucataco/flux-queso

A Flux LoRA trained on photos of Jake's dog: Queso

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community320 runs

lucataco/minicpm-v-4

MiniCPM-V 4.0 has strong image and video understanding performance

Refs

Community320 runs

lisandroe/loralisandro

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community316 runs

scmdr/ai_graphic

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community312 runs

jfobrien29/flux-us-national-parks

Generate photos like old school US National Park Posters

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community310 runs

aj1357/ai-tshirt-mockup

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community306 runs

advoworks/lokidog

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community304 runs

fiolinuda/fiona

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community302 runs

carlosperz88/teslacyberbeast

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community298 runs

okostadsjunior/ninicollyloli

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community294 runs

garyvoo/person2_new

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community293 runs

fofr/flux-tessellate

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community290 runs

alexvoorheesmusic/yanisse

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community282 runs

cjwbw/canary-1b

Nvidia Automatic speech-to-text recognition (ASR) in 4 languages (English, German, French, Spanish)

Community277 runs

rerm06/flux-dev-lora-rene

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community275 runs

bria/video-increase-resolution

Upscale videos up to 8K output resolution. Trained on fully licensed and commercially safe data.

Community272 runs

koratkar/miyazaki-watercolor

Hayao Miyazaki watercolor sketch LoRA

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community268 runs

prunaai/p-image-trainer

Fast LoRA trainer for p-image, a super fast text-to-image model developed by Pruna AI. Use LoRAs here: https://replicate...

Steps

Community267 runs

shapestudio/wild-flux

Edward Hopper(realist painter) inspired Flux Lora

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community263 runs

fofr/flux-beyond-horizon

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community258 runs

reve/reve-2.1

Generate and edit images from text and reference images with Reve 2.1

Refs

Community247 runs

runwayml/aleph-2

Edit one frame to update an entire video. Aleph 2.0 is Runway's in-context video editor: longer clips (up to 30s), multi...

Seed

Community246 runs

lipex157zoi/lipexbradok

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community245 runs

igorriti/flux-fileteado

Fileteado porteño style for Flux

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community238 runs

roberthein/modelname-new

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community233 runs

mofleck/arcodeltriunfosjv1

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community231 runs

bonapartee/thumbnail-generator

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community230 runs

wowohello/fat_funjin

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community229 runs

geobotsar81/geobotsar-flux-2

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community225 runs

openai/gpt-5.6-sol

OpenAI's GPT-5.6 flagship tier, built for complex professional work, coding, and deep multi-step reasoning.

Refs

Community222 runs

adirik/udop-large

Performs document image classification, document parsing and document visual question answering

Community220 runs

tokaito14/art

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community220 runs

szdavid11/ola-scooter

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community217 runs

shapestudio/portra-800-flux

Flux Lora inspired by Kodak Portra 800

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community215 runs

witzelfitz/gardenflux

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community213 runs

aperture-2/hido

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community212 runs

julienhypeer/julienetoke

Model fine tuned avec des photos de moi avec casquette, prises de face du dossier "Shooting miniatures décembre 2023"

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community211 runs

topazlabs/image-colorization

Image colorization model from Topaz Labs

RefsFormat

Community209 runs

idvorkin/idvorkin-flux-lora-1

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community207 runs

lucataco/qwen-davinci

Qwen-image fine-tuned on Drawings by Leonardo da Vinci

NegSeedGuidanceStepsFormatW/HLoRA

Community196 runs

ccchot-osk103/pai_qwen_21102568

a young lady name Pai

NegSeedGuidanceStepsFormatW/HLoRA

Community196 runs

prestoncreed/flux-spongebob

Fine-Tuned on spongebob environment/aesthetic images. Use "in the SPONGE world" to get best results.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community195 runs

jarvis-labs2024/console_cowboy_flux

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community188 runs

topazlabs/dust-and-scratch-v2

Remove dust and scratches from old photos

RefsFormat

Community188 runs

benj-edwards/dads-uppercase-2500-steps

A recreation of my dad's uppercase engineer-style handwriting.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community188 runs

sisyphos55/adsgenerator

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community187 runs

dunaevai135/tst_svt3

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community180 runs

someone12dd/muskoil1

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community179 runs

lucataco/bulk-video-caption

Video Preprocessing tool for captioning multiple videos using GPT, Claude or Gemini

Community179 runs

fofr/flux-myst

Flux lora based on the original Myst video game

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community176 runs

okturan/flux-yesilcam

Creates images similar to scenes found in Yeşilçam movies

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community174 runs

paulbasic/flux_paulh

a private model to play around and create fun

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community170 runs

lucataco/video-split

Simple tool to split apart a video into snippets

Community169 runs

kshitijagrwl/pii-extractor-llm

PII Data Extraction from Text

Community168 runs

lucataco/kontext-meta-cars

Change your car into a CDMX Meta Car

RefsSeedGuidanceStepsFormatSafety

Community168 runs

ludocomito/flux-caravaggio

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community166 runs

derrrick/flux-70s-bands

lora for 70s band aesthetics

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community165 runs

fofr/flux-hyundai-n-vision-74

Flux lora, use "Hyundai N Vision 74 car" to trigger image generation

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community164 runs

zzyjay/icedevsfm

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community164 runs

ah77ac/majestyrabbit-flux-lora

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community159 runs

0xdeadd/flux-your-model-name

A fine-tuned FLUX.1 model

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community159 runs

fesilva196/lipex

A man

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community157 runs

jakedahn/flux-soviet-controlrooms

Flux.1 fine-tune on soviet-era controlrooms

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community155 runs

dunaevai135/tst2_d

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community154 runs

markbland82/mjbstyle1

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community152 runs

sigil-wen/laika-flux-lora

A FLUX.1 [dev] LoRA trained on Laika, Julia's dog

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community151 runs

lucataco/apollo-3b

Apollo 3B - An Exploration of Video Understanding in Large Multimodal Models

Temp

Community150 runs

joshelgar/rssmurryflux

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community146 runs

gajendrajha09/fluxloramimi

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community140 runs

szdavid11/kellogs-chocos

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community139 runs

bria/video-erase-object

A high-fidelity capability for erasing unwanted objects, people, or visual elements from videos while maintaining aesthe...

Community137 runs

shridharathi/ghibli-vid

Make a video of anything in Studio Ghibli style

RefsNegSeedSteps

Community137 runs

s-clementc/manon

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community132 runs

tylerbishopdev/retylerv2

Another Tyler; custom to your liking

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community132 runs

heygen/avatar-v

Create realistic talking avatar videos from text with HeyGen's Avatar V engine — the newest, highest-quality avatar engi...

Voice

Community130 runs

melaesse/ginevra

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community128 runs

wuzoobia/bruna-portrait

Create stunning corporate portraits and executive headshots with photorealistic quality

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community125 runs

hartmamt/flux-chicago-firesidebowl

Flux lora for scenes from famous Fireside Bowl, use "FRSBWL" to trigger

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community123 runs

jarvis-labs2024/flux-appleseed

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community121 runs

ibm-granite/granite-embedding-small-english-r2

Granite-embedding-small-english-r2 is a 47M parameter dense biencoder embedding model from the Granite Embeddings collec...

Community119 runs

decart/lucy-edit-2

Edit and transform videos with text prompts and reference images. Style transfers, object replacement, character transfo...

RefsSeed

Community117 runs

abyssalsoul/cyberdad

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community116 runs

therendercafe/therendercafegmailcom-sarahi-2595

A fine-tuned FLUX.1 model

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community116 runs

lucataco/vid2webp

Convert your video into webp format (with looping)

LoopFPS

Community115 runs

dfrostar/hermes_kellybag

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community114 runs

urieltac/polar-fleece

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community114 runs

jarvis-labs2024/flux-raylene

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community112 runs

sevensevenimages/rolxsub

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community108 runs

cleexiang/jellycat

Flux Lora, use "jellycat toy" to trigger style like jellycat toy.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community108 runs

levelsio/brain-flakes-ai

Generate your own Brain Flakes with AI

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community105 runs

fofr/qwen-my-subconscious

Qwen fine-tuned on trippy and vibrant FLUX Pro outputs

NegSeedGuidanceStepsFormatW/HLoRA

Community104 runs

chatworks/luchtpods

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community102 runs

someone12dd/libreintense

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community101 runs

cristobalascencio/florentino

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community97 runs

ibm-granite/granite-speech-4.1-2b

Granite Speech 4.1 2B is a compact and efficient speech-language model, specifically designed for multilingual automatic...

SeedTemp

Community96 runs

dunaevai135/tst_trn

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community94 runs

uthana/text-to-motion-diffusion-v2

Generate 3D character animation data from a text prompt

SeedGuidanceFPS

Community92 runs

fofr/flux-brighton-west-pier

Flux lora, trigger image generation with "west pier"

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community91 runs

popovluka/luka-flux-lora-model

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community90 runs

maikocode/ascii-style

Turn image into ASCII style. If you like this LoRA visit my app: YourAIPhotographer.com

RefsSeedGuidanceStepsFormatSafety

Community90 runs

bria/fibo-edit

FIBO-Edit brings the power of structured prompt generation to image editing

RefsNegSeedGuidanceMask

Community86 runs

warf23/agrat_me

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community86 runs

10xcrazyhorse/bonkfa

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community83 runs

tokaito14/laura

Model

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community81 runs

jcbianchi010/jeancbianchi

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community80 runs

ihor-thecoach/ihor2

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community77 runs

0xdeadd/vonda2

A fine-tuned FLUX.1 model

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community77 runs

uthana/create-character-v1

Rig any 3D bipedal character mesh

Community76 runs

damdam775/portraits_dialogues

Create RPG like expressive character portraits for in game dialogs.

RefsSeedGuidanceStepsFormatSafety

Community76 runs

enkey08/ganpatibappa

Ganpati image generation model trained on all Ganapati in Pune, India

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community75 runs

sigil-wen/pepsi-flux

FLUX.1 [dev] LoRA trained on images of my cat Pepsi

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community74 runs

banta2000/mytestmodel

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community73 runs

mintkaori/seoulgi

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community72 runs

cristobalascencio/wirra

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community72 runs

kazdatahelp/llkm

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community72 runs

mofleck/teslasemishipped

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community72 runs

shridharathi/blueprint-qwen

No description available

NegSeedGuidanceStepsFormatW/HLoRA

Community71 runs

fofr/qwen-2004

Qwen fine-tuned on bad photos from 2004

NegSeedGuidanceStepsFormatW/HLoRA

Community70 runs

puribogdan/puri

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community70 runs

warf23/ai_glasses

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community70 runs

marcusschwarze/paul-mittelrheintaler

a public experiment with the AI generated figure of Paul Mittelrheintaler, a cool YouTuber in the middle Rhine Valley. C...

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community69 runs

yuezheng2006/flux-lora-gyy

A Fulx1.D based lora for gaoyuanyuan,trigger word "gyy"

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community58 runs

davidkwcheng/jessicayu02

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community58 runs

esgfit/flux_andreas

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community58 runs

alekseycalvin/acsoonr_flux

ACS token

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community57 runs

grassyguru/archslra

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community57 runs

imiroslav/ai_skoda_octavia

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community57 runs

pbevan1/llama-3.1-8b-ocr-correction

LLaMA 3.1-8B, finetuned on a synthetic OCR dataset for superior OCR correction.

Community57 runs

titouv/test_model3

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community56 runs

anon987654321/ra2

No description available

NegSeedGuidanceStepsFormatW/HLoRA

Community56 runs

fofr/qwen-bad-70s-food

Qwen fine-tuned on photos of bad 70s food

NegSeedGuidanceStepsFormatW/HLoRA

Community55 runs

maybasmanphotographer/bmayb

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community55 runs

ugleh/flux-dev-lora-painterdetective

Flux LoRA, use 'PRVDETC' to trigger generation, creates characters based on the art style of the board game Painter Dete...

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community54 runs

ludocomito/flux-kuji

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community54 runs

ummtushar/pilot

A fine-tuned FLUX.1 model v1.1

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community52 runs

arthuryeti/dwiss-qwen-2

No description available

NegSeedGuidanceStepsFormatW/HLoRA

Community51 runs

astronautdavid1/mad2-flux-lora

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community50 runs

saysenbl/lisflux

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community49 runs

andreaxricci/maradona

model fine-tuned with pictures of Maradona, the best soccer player of all times :)

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community47 runs

dunaevai135/flux_lora_fungus

use TOK for rhizomorphic mycelium

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community47 runs

maffuw/scrub-runners2

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community46 runs

uthana/text-to-motion-vqvae-v1

Generate 3D character animation data from a text prompt

FPS

Community46 runs

felixwky-code/sateur-lora

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community44 runs

roadmaus/stillsuit

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community44 runs

mckunkel/alexs

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community43 runs

siamakf/petpatrol-joyi

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community42 runs

guokai34370298/andy

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community39 runs

maffuw/shakira

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community39 runs

juddisjudd/ckh-flux-model

Lora of myself

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community37 runs

matsnl65/langstonrichardson

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community37 runs

justinwkukm/a_photo_of_wko

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community37 runs

gaby94500/mamav

No description available

NegSeedGuidanceStepsFormatW/HLoRA

Community36 runs

fofr/qwen-dark-art

Qwen fine-tuned on classical dark artwork

NegSeedGuidanceStepsFormatW/HLoRA

Community32 runs

leosy-kingdom/leosy-earth2

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community31 runs

fofr/qwen-william-blake

Qwen fine-tuned on the art of William Blake

NegSeedGuidanceStepsFormatW/HLoRA

Community31 runs

mike-freeai/hx8y

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community30 runs

ameureka/amazing

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community28 runs

paullarin671/lawther

Flux lora of Alex Lawther, trained on 20 images. Trigger word:"lawther"

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community28 runs

maubad/calaverasconllamas

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community27 runs

juddisjudd/deriksen

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community26 runs

zeke/ziki-2024-08-30

Use trigger word ziki-2024-08-30 to activate the trained LoRA style.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community25 runs

mike-gbenlandia/nigerian

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community23 runs

ugleh/flux-dev-lora-fallguy2

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community23 runs

joshelgar/andrwfrclgh

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community21 runs

kazdatahelp/tql

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community20 runs

jimmywong974/yoshi

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community19 runs

fofr/qwen-black-sclera

No description available

NegSeedGuidanceStepsFormatW/HLoRA

Community19 runs

rsjejonathanortiz/jonathan_heroes

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community18 runs

juddisjudd/barricadettv

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community17 runs

fuskio64/eugenito

Show a nice guy if you use the keyword Eugenito

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community15 runs

johnseepps/hoodicon

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community15 runs

qwen/qwen3-7-plus

Qwen3.7-Plus is Alibaba's cost-effective multimodal model with vision-language understanding, a 1 million token context ...

RefsTemp

Community14 runs

juddisjudd/kevine

No description available

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community13 runs

fofr/qwen-n74

No description available

NegSeedGuidanceStepsFormatW/HLoRA

Community13 runs

fofr/qwen-tron-ares

Qwen fine-tuned on TRON: ARES trailer

NegSeedGuidanceStepsFormatW/HLoRA

Community11 runs

openai/gpt-5.6-terra

OpenAI's GPT-5.6 balanced tier, tuned for everyday production work at roughly half the cost of the flagship.

Refs

Community10 runs

brucecris/bruce1

The first Bruce model.

RefsSeedGuidanceStepsFormatW/HSafetyMultiMaskLoRA

Community10 runs

yosun/camcorgi-qwern

a qwen lora that knows that CAM corgi looks like... https://replicate.delivery/xezq/Qeh4prhrfEjLOEoB2kgdzS98yArm2SoruIPG...

NegSeedGuidanceStepsFormatW/HLoRA

Community9 runs

ccchot-osk103/happycat

No description available

NegSeedGuidanceStepsFormatW/HLoRA

Community9 runs

ccchot-osk103/buacat

the grey tabby cat

NegSeedGuidanceStepsFormatW/HLoRA

Community8 runs

openai/gpt-5.6-luna

OpenAI's GPT-5.6 cost-optimized tier, built for fast, high-volume, latency-sensitive workloads.

Refs

Community7 runs

ccchot-osk103/moneycat

a cute and young orange British Shorthair cat

NegSeedGuidanceStepsFormatW/HLoRA

Community5 runs