FLUX.1-dev-ControlNet-Union-Pro-2.0开源模型 - 支持多控制模式，提升美学表现

首页

FLUX.1 Dev ControlNet Union Pro 2.0

由 Shakker-Labs 开发

基于FLUX.1-dev模型的统一ControlNet，支持多种控制模式，改进控制效果和美学表现

图像生成英语开源协议:其他 #多条件控制 #美学增强 #图像生成优化

下载量 20.40k

发布时间 : 4/14/2025

模型简介

该模型是一个用于文本生成图像任务的ControlNet，支持canny、soft edge、depth、pose、gray等多种控制模式，可与FLUX.1-dev基础模型配合使用生成高质量图像。

模型特点

多控制模式支持

支持canny、soft edge、depth、pose、gray等多种控制模式

改进的控制效果

相比前代版本，改进了canny和pose的控制效果和美学表现

体积优化

移除了模式嵌入，模型体积更小

多重控制支持

可与其他ControlNet联合使用，实现多重条件控制

模型能力

文本生成图像

图像条件控制

多条件联合控制

使用案例

创意设计

人像生成

根据姿势图生成高质量人像

生成符合姿势要求且具有美学价值的人像

场景生成

根据深度图生成3D场景

生成符合深度关系的逼真场景

艺术创作

艺术风格转换

根据边缘图生成不同艺术风格的图像

保持原始结构的同时应用不同艺术风格

🚀 FLUX.1-dev-ControlNet-Union-Pro-2.0

本仓库包含由 Shakker Labs 发布的用于 FLUX.1-dev 模型的统一 ControlNet。我们提供了一个在线演示。社区提供的 FP8 量化版本可在 ABDALLALSWAITI/FLUX.1-dev-ControlNet-Union-Pro-2.0-fp8 中找到。

✨ 主要特性

与 Shakker-Labs/FLUX.1-dev-ControlNet-Union-Pro 相比：

移除了模式嵌入，模型尺寸更小。
在 Canny 和姿态控制方面有所改进，控制效果和美学表现更佳。
增加了对软边缘的支持，移除了对平铺模式的支持。

📚 详细文档

模型卡片

此 ControlNet 由 6 个双块和 0 个单块组成，移除了模式嵌入。
我们使用包含 2000 万张高质量通用和人物图像的数据集，从零开始对模型进行了 30 万步的训练。训练分辨率为 512x512，使用 BFloat16 格式，批量大小为 128，学习率为 2e-5，引导系数从 [1, 7] 中均匀采样。我们将文本丢弃率设置为 0.20。
该模型支持多种控制模式，包括 Canny、软边缘、深度、姿态和灰度。您可以像使用普通的 ControlNet 一样使用它。
此模型可以与其他 ControlNet 联合使用。

展示示例

控制模式	示例图片
Canny
软边缘
姿态
深度
灰度

💻 使用示例

基础用法

import torch
from diffusers.utils import load_image
from diffusers import FluxControlNetModel, FluxControlNetPipeline

base_model = 'black-forest-labs/FLUX.1-dev'
controlnet_model_union = 'Shakker-Labs/FLUX.1-dev-ControlNet-Union-Pro-2.0'

controlnet = FluxControlNetModel.from_pretrained(controlnet_model_union, torch_dtype=torch.bfloat16)
pipe = FluxControlNetPipeline.from_pretrained(base_model, controlnet=controlnet, torch_dtype=torch.bfloat16)
pipe.to("cuda")

# 替换为其他条件图像
control_image = load_image("./conds/canny.png")
width, height = control_image.size

prompt = "A young girl stands gracefully at the edge of a serene beach, her long, flowing hair gently tousled by the sea breeze. She wears a soft, pastel-colored dress that complements the tranquil blues and greens of the coastal scenery. The golden hues of the setting sun cast a warm glow on her face, highlighting her serene expression. The background features a vast, azure ocean with gentle waves lapping at the shore, surrounded by distant cliffs and a clear, cloudless sky. The composition emphasizes the girl's serene presence amidst the natural beauty, with a balanced blend of warm and cool tones."

image = pipe(
    prompt, 
    control_image=control_image,
    width=width,
    height=height,
    controlnet_conditioning_scale=0.7,
    control_guidance_end=0.8,
    num_inference_steps=30, 
    guidance_scale=3.5,
    generator=torch.Generator(device="cuda").manual_seed(42),
).images[0]

高级用法

import torch
from diffusers.utils import load_image

# https://github.com/huggingface/diffusers/pull/11350
# 您可以通过从源代码安装最新版本直接从 diffusers 导入
# from diffusers import FluxControlNetPipeline, FluxControlNetModel

# 目前使用本地文件
from pipeline_flux_controlnet import FluxControlNetPipeline
from controlnet_flux import FluxControlNetModel

base_model = 'black-forest-labs/FLUX.1-dev'
controlnet_model_union = 'Shakker-Labs/FLUX.1-dev-ControlNet-Union-Pro-2.0'

controlnet = FluxControlNetModel.from_pretrained(controlnet_model_union, torch_dtype=torch.bfloat16)
pipe = FluxControlNetPipeline.from_pretrained(base_model, controlnet=[controlnet], torch_dtype=torch.bfloat16) # 使用列表以启用多 ControlNet
pipe.to("cuda")

# 替换为其他条件图像
control_image = load_image("./conds/canny.png")
width, height = control_image.size

prompt = "A young girl stands gracefully at the edge of a serene beach, her long, flowing hair gently tousled by the sea breeze. She wears a soft, pastel-colored dress that complements the tranquil blues and greens of the coastal scenery. The golden hues of the setting sun cast a warm glow on her face, highlighting her serene expression. The background features a vast, azure ocean with gentle waves lapping at the shore, surrounded by distant cliffs and a clear, cloudless sky. The composition emphasizes the girl's serene presence amidst the natural beauty, with a balanced blend of warm and cool tones."

image = pipe(
    prompt, 
    control_image=[control_image, control_image], # 尝试使用不同的条件，如 Canny 和深度、姿态和深度
    width=width,
    height=height,
    controlnet_conditioning_scale=[0.35, 0.35],
    control_guidance_end=[0.8, 0.8],
    num_inference_steps=30, 
    guidance_scale=3.5,
    generator=torch.Generator(device="cuda").manual_seed(42),
).images[0]

📄 许可证

本项目使用 flux-1-dev-non-commercial-license 许可证。

🔗 相关资源

🙏 致谢

📑 引用

如果您在研究中发现本项目有用，请通过以下方式引用我们：

@misc{flux-cn-union-pro-2,
    author = {Shakker-Labs},
    title = {ControlNet-Union},
    year = {2025},
    howpublished={\url{https://huggingface.co/Shakker-Labs/FLUX.1-dev-ControlNet-Union-Pro-2.0}},
}