controlnet-union-sdxl-1.0 open-source image tool - Supports 12 conditions and 5 types of editing for free creation

Controlnet Union Sdxl 1.0

Developed by xinsir

All-round image generation and editing control network, supporting 12 control conditions and 5 advanced editing functions

Image Generation Open Source License:Apache-2.0 #Multi-condition image generation #High-resolution image editing #SDXL compatibility

Downloads 156.68k

Release Time : 7/7/2024

Model Overview

ControlNet++ is an enhanced control network based on Stable Diffusion XL, supporting multiple image control conditions and advanced editing functions, capable of generating high-quality images and performing fine editing.

Model Features

Multi-condition control

Supports 12 control conditions, including various control methods such as pose, depth, and edge

Advanced editing functions

Provides 5 advanced editing functions: block deblurring, block variation, block super-resolution, image restoration, and image expansion

High-resolution support

Adopts NovelAI-style bucket training, supporting high-resolution image generation in any ratio

High-quality dataset

Trained with a high-quality dataset of over tens of millions, covering diverse scenarios

Strong compatibility

Compatible with open-source SDXL models such as BluePencilXL and CounterfeitXL, supporting various Lora models

Model Capabilities

Text-to-image

Image editing

Image restoration

Image expansion

Super-resolution

Multi-condition control

Use Cases

Creative design

Concept art creation

Generate character concept maps using pose control

Artworks with precisely controlled character poses

Scene design

Generate 3D scenes using depth map control

Scene images with precise depth information

Image processing

Image restoration

Restore images with damaged or missing parts

Completely restored images

Image super-resolution

Improve image resolution

High-definition images upgraded from 1 million pixels to 9 million pixels

Anime creation

Line drawing coloring

Generate color images using anime line drawings for control

Color anime images maintaining the style of the original line drawings

🚀 ControlNet++: All-in-one ControlNet for image generations and editing!

ControlNet++ is an all - in - one solution for image generation and editing. It offers a ProMax model with 12 controls and 5 advanced editing features, enabling high - quality image output comparable to Midjourney.

🚀 Quick Start

Inference scripts and more details can be found at: https://github.com/xinsir6/ControlNetPlus/tree/main

✨ Features

ProMax Model Release

The ProMax model has been released! It comes with 12 controls and 5 advanced editing features. Just give it a try!

Visual Display

images_display

Network Architecture

images

Advantages of the Model

High - Resolution Image Generation: Utilizes bucket training similar to NovelAI, capable of generating high - resolution images of any aspect ratio.
Large - Scale High - Quality Data: Trained on a large amount of high - quality data (over 10000000 images), covering a wide range of scenarios.
Enhanced Prompt Following: Employs re - captioned prompts like DALLE.3, using CogVLM to generate detailed descriptions, resulting in excellent prompt - following ability.
Effective Training Tricks: Applies various useful tricks during training, including but not limited to data augmentation, multiple loss functions, and multi - resolution training.
Low Parameter Increase: Has almost the same number of parameters as the original ControlNet, without a significant increase in network parameters or computation.
Multiple Control Conditions: Supports 10+ control conditions, with no obvious performance drop on any single condition compared to independent training.
Multi - Condition Generation: Supports multi - condition generation, with condition fusion learned during training. No need to set hyperparameters or design complex prompts.
Compatibility: Compatible with other open - source SDXL models, such as BluePencilXL and CounterfeitXL, as well as other Lora models.

Technical Innovation

We designed a new architecture that can support 10+ control types in text - to - image generation and produce high - resolution images visually comparable to those of Midjourney. Based on the original ControlNet architecture, we proposed two new modules:

Extend the original ControlNet to support different image conditions using the same network parameters.
Enable multiple conditions input without increasing computation offload, which is crucial for designers who need detailed image editing. Different conditions share the same condition encoder, without adding extra computations or parameters. We conducted thorough experiments on SDXL and achieved superior performance in both control ability and aesthetic score. We released the method and the model to the open - source community for everyone to enjoy.

💻 Usage Examples

Advanced Editing Features in ProMax Model

Tile Deblur

blur0 blur1 blur2 blur3 blur4 blur5

Tile Variation

var0 var1 var2 var3 var4 var5

Tile Super Resolution

The following examples show the transition from 1M resolution to 9M resolution:

Image Inpainting

inp0 inp1 inp2 inp3 inp4 inp5

Image Outpainting

oup0 oup1 oup2 oup3 oup4 oup5

Visual Examples

Openpose

pose0 pose1 pose2 pose3 pose4

Depth

depth0 depth1 depth2 depth3 depth4

Canny

canny0 canny1 canny2 canny3 canny4

Lineart

lineart0 lineart1 lineart2 lineart3 lineart4

AnimeLineart

animelineart0 animelineart1 animelineart2 animelineart3 animelineart4

Mlsd

mlsd0 mlsd1 mlsd2 mlsd3 mlsd4

Scribble

scribble0 scribble1 scribble2 scribble3 scribble4

Hed

hed0 hed1 hed2 hed3 hed4

Pidi(Softedge)

pidi0 pidi1 pidi2 pidi3 pidi4

Teed

ted0 ted1 ted2 ted3 ted4

Segment

segment0 segment1 segment2 segment3 segment4

Normal

normal0 normal1 normal2 normal3 normal4

Multi - Control Visual Examples

Openpose + Canny

pose_canny0 pose_canny1 pose_canny2 pose_canny3 pose_canny4 pose_canny5

Openpose + Depth

pose_depth0 pose_depth1 pose_depth2 pose_depth3 pose_depth4 pose_depth5

Openpose + Scribble

pose_scribble0 pose_scribble1 pose_scribble2 pose_scribble3 pose_scribble4 pose_scribble5

Openpose + Normal

pose_normal0 pose_normal1 pose_normal2 pose_normal3 pose_normal4 pose_normal5

Openpose + Segment

pose_segment0 pose_segment1 pose_segment2 pose_segment3 pose_segment4 pose_segment5

📄 License

This project is licensed under the Apache - 2.0 license.

💡 Usage Tip

If you find it useful, please give me a star. Thank you very much! The SDXL ProMax version has been released. Enjoy it!

⚠️ Important Note

I'm sorry that due to the difficulty in balancing the project's revenue and expenditure, the GPU resources have been allocated to other more profitable projects. The SD3 training is stopped until I can find enough GPU support. I will try my best to find GPUs to continue training. If this causes any inconvenience, I sincerely apologize. I want to thank everyone who likes this project. Your support keeps me going.

Note: We put the promax model with a promax suffix in the same huggingface model repo. Detailed instructions will be added later.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご