LCM-SDXL Open-Source AI Model - Fast Image Inference, Generate Images in 2

Lcm Sdxl

Developed by latent-consistency

A latent consistency model based on Stable Diffusion XL, reducing inference steps to 2-8

Image Generation #Ultra-fast Text-to-Image #Few-step Inference #SDXL Optimization

Downloads 882

Release Time : 11/7/2023

Model Overview

This is a version distilled with LCM from stable-diffusion-xl-base-1.0, significantly reducing the required inference steps for image generation while maintaining high-quality output.

Model Features

Fast Inference

Through LCM distillation technology, reduces inference steps from traditional SDXL's 25-50 steps to just 2-8 steps

High-Quality Output

Maintains image quality comparable to original SDXL even with minimal inference steps

Multi-functional Support

Supports text-to-image, image-to-image, inpainting, ControlNet control, and T2I adapters

Model Capabilities

Text-to-image generation

Image-to-image translation

Image inpainting

Controlled image generation

Use Cases

Creative Design

Concept Art Creation

Quickly generate high-quality concept art images

Produces usable artwork within 4 inference steps

Commercial Applications

Advertising Material Generation

Rapidly iterate visual content for advertising creativity

Significantly reduces creative production time

🚀 Latent Consistency Model (LCM): SDXL

Latent Consistency Model (LCM) enables high - resolution image synthesis with few - step inference, offering a more efficient way for text - to - image generation.

🚀 Quick Start

Latent Consistency Model (LCM) was proposed in Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference by Simian Luo, Yiqin Tan et al. and Simian Luo, Suraj Patil, and Daniel Gu successfully applied the same approach to create LCM for SDXL.

This checkpoint is a LCM distilled version of stable-diffusion-xl-base-1.0 that allows to reduce the number of inference steps to only between 2 - 8 steps.

✨ Features

Fewer Inference Steps: Reduces the inference steps to 2 - 8 steps, significantly improving efficiency.
Multiple Modes Supported: Supports text - to - image, image - to - image, inpainting, ControlNet, and T2I Adapter modes.

📦 Installation

LCM SDXL is supported in 🤗 Hugging Face Diffusers library from version v0.23.0 onwards. To run the model, first install the latest version of the Diffusers library as well as peft, accelerate and transformers. audio dataset from the Hugging Face Hub:

pip install --upgrade pip
pip install --upgrade diffusers transformers accelerate peft

💻 Usage Examples

Basic Usage

Text - to - Image

The model can be loaded with it's base pipeline stabilityai/stable-diffusion-xl-base-1.0. Next, the scheduler needs to be changed to LCMScheduler and we can reduce the number of inference steps to just 2 to 8 steps. Please make sure to either disable guidance_scale or use values between 1.0 and 2.0.

from diffusers import UNet2DConditionModel, DiffusionPipeline, LCMScheduler
import torch

unet = UNet2DConditionModel.from_pretrained("latent-consistency/lcm-sdxl", torch_dtype=torch.float16, variant="fp16")
pipe = DiffusionPipeline.from_pretrained("stabilityai/stable-diffusion-xl-base-1.0", unet=unet, torch_dtype=torch.float16, variant="fp16")

pipe.scheduler = LCMScheduler.from_config(pipe.scheduler.config)
pipe.to("cuda")

prompt = "a close-up picture of an old man standing in the rain"

image = pipe(prompt, num_inference_steps=4, guidance_scale=8.0).images[0]

Advanced Usage

Image - to - Image

Works as well! TODO docs

Inpainting

Works as well! TODO docs

ControlNet

Works as well! TODO docs

T2I Adapter

Works as well! TODO docs

📚 Documentation

Speed Benchmark

TODO

Training

TODO

📄 License

The license for this project is openrail++.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご