InstantID Open-Source Model - Achieve Identity-Preserving Generation with a Single Image, Support Diverse Downstream Tasks!

Instantid

Developed by InstantX

InstantID is an advanced, tuning-free method that achieves identity-preserving generation with just a single image, supporting multiple downstream tasks.

Image Generation EnglishOpen Source License:Apache-2.0 #Zero-shot Identity Preservation #Single Image Generation #Face Embedding

Downloads 86.99k

Release Time : 1/19/2024

Model Overview

InstantID is an innovative identity-preserving generation model capable of generating new images that retain identity features from a single reference image, suitable for various image generation tasks.

Model Features

No Tuning Required

Achieves identity-preserving generation with just a single image, no additional training or tuning needed.

Identity Preservation

Accurately captures and maintains identity features from the reference image.

Multi-task Support

Supports various downstream image generation tasks such as style transfer, age transformation, etc.

Fast Generation

Capable of producing high-quality images in just a few seconds.

Model Capabilities

Identity-preserving Image Generation

Style Transfer

Age Transformation Simulation

Expression Change

Artistic Style Transfer

Use Cases

Creative Design

Artistic Portrait Creation

Transforms ordinary photos into portraits of various artistic styles.

Generates portraits with artistic styles while preserving the subject's identity.

Entertainment Applications

Age Transformation Simulation

Simulates a person's appearance at different age stages.

Generates realistic and believable portraits across different age groups.

🚀 InstantID Model Card

InstantID is a cutting - edge, tuning - free method for ID - Preserving generation with just a single image. It supports a variety of downstream tasks, revolutionizing the field of AI - driven image generation.

🚀 Quick Start

✨ Features

A state - of - the - art tuning - free approach for ID - Preserving generation.
Only requires a single image for generation.
Supports multiple downstream tasks.

📦 Installation

You can directly download the model in this repository. You also can download the model in a Python script:

from huggingface_hub import hf_hub_download
hf_hub_download(repo_id="InstantX/InstantID", filename="ControlNetModel/config.json", local_dir="./checkpoints")
hf_hub_download(repo_id="InstantX/InstantID", filename="ControlNetModel/diffusion_pytorch_model.safetensors", local_dir="./checkpoints")
hf_hub_download(repo_id="InstantX/InstantID", filename="ip-adapter.bin", local_dir="./checkpoints")

For the face encoder, you need to manually download it via this URL to models/antelopev2.

💻 Usage Examples

Basic Usage

# !pip install opencv-python transformers accelerate insightface
import diffusers
from diffusers.utils import load_image
from diffusers.models import ControlNetModel

import cv2
import torch
import numpy as np
from PIL import Image

from insightface.app import FaceAnalysis
from pipeline_stable_diffusion_xl_instantid import StableDiffusionXLInstantIDPipeline, draw_kps

# prepare 'antelopev2' under ./models
app = FaceAnalysis(name='antelopev2', root='./', providers=['CUDAExecutionProvider', 'CPUExecutionProvider'])
app.prepare(ctx_id=0, det_size=(640, 640))

# prepare models under ./checkpoints
face_adapter = f'./checkpoints/ip-adapter.bin'
controlnet_path = f'./checkpoints/ControlNetModel'

# load IdentityNet
controlnet = ControlNetModel.from_pretrained(controlnet_path, torch_dtype=torch.float16)

pipe = StableDiffusionXLInstantIDPipeline.from_pretrained(
    "stabilityai/stable-diffusion-xl-base-1.0", controlnet=controlnet, torch_dtype=torch.float16
)
pipe.cuda()

# load adapter
pipe.load_ip_adapter_instantid(face_adapter)

Advanced Usage

# load an image
image = load_image("your-example.jpg")

# prepare face emb
face_info = app.get(cv2.cvtColor(np.array(face_image), cv2.COLOR_RGB2BGR))
face_info = sorted(face_info, key=lambda x:(x['bbox'][2]-x['bbox'][0])*x['bbox'][3]-x['bbox'][1])[-1] # only use the maximum face
face_emb = face_info['embedding']
face_kps = draw_kps(face_image, face_info['kps'])

pipe.set_ip_adapter_scale(0.8)

prompt = "analog film photo of a man. faded film, desaturated, 35mm photo, grainy, vignette, vintage, Kodachrome, Lomography, stained, highly detailed, found footage, masterpiece, best quality"
negative_prompt = "(lowres, low quality, worst quality:1.2), (text:1.2), watermark, painting, drawing, illustration, glitch, deformed, mutated, cross-eyed, ugly, disfigured (lowres, low quality, worst quality:1.2), (text:1.2), watermark, painting, drawing, illustration, glitch,deformed, mutated, cross-eyed, ugly, disfigured"

# generate image
image = pipe(
    prompt, image_embeds=face_emb, image=face_kps, controlnet_conditioning_scale=0.8
).images[0]

For more details, please follow the instructions in our GitHub repository.

💡 Usage Tip

If you're not satisfied with the similarity, try to increase the weight of "IdentityNet Strength" and "Adapter Strength".
If you feel that the saturation is too high, first decrease the Adapter strength. If it is still too high, then decrease the IdentityNet strength.
If you find that text control is not as expected, decrease Adapter strength.
If you find that realistic style is not good enough, go for our Github repo and use a more realistic base model.

🎨 Demos

📄 License

This project is released under Apache License and aims to positively impact the field of AI - driven image generation. Users are granted the freedom to create images using this tool, but they are obligated to comply with local laws and utilize it responsibly. The developers will not assume any responsibility for potential misuse by users.

📚 Citation

@article{wang2024instantid,
  title={InstantID: Zero-shot Identity-Preserving Generation in Seconds},
  author={Wang, Qixun and Bai, Xu and Wang, Haofan and Qin, Zekui and Chen, Anthony},
  journal={arXiv preprint arXiv:2401.07519},
  year={2024}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご