controlnet_qrcode-control_v11p_sd21開源模型 - 輕鬆將二維碼融入藝術圖像

首頁

Controlnet Qrcode Control V11p Sd21

由DionTimmer開發

專為Stable Diffusion 2.1設計的二維碼條件控制網絡模型，可將二維碼融入藝術圖像中

圖像生成英語#二維碼藝術生成 #高容錯率控制 #廣告牌設計優化

下載量 60

發布時間 : 6/15/2023

模型概述

該模型是一個ControlNet模型，用於在Stable Diffusion 2.1中生成包含可掃描二維碼的藝術圖像。它能在保持藝術風格的同時確保二維碼功能正常。

模型特點

二維碼融合

能將二維碼自然地融入生成的藝術圖像中，同時保持可掃描性

高分辨率支持

推薦使用768分辨率生成，可獲得更精細的細節表現

參數可調

通過調節guidance_scale、controlnet_conditioning_scale和strength參數控制二維碼與藝術效果的平衡

多版本兼容

除了SD2.1版本外，還提供了基於相同數據集的SD1.5版本模型

模型能力

藝術二維碼生成

圖像風格轉換

二維碼與圖像融合

可控圖像生成

使用案例

商業應用

創意廣告牌

生成包含可掃描二維碼的藝術廣告牌圖像

如示例中的紐約市廣告牌，既美觀又實用

數字藝術

藝術二維碼作品

創建具有藝術風格的二維碼作品

二維碼與藝術風格自然融合，同時保持掃描功能

🚀 用於Stable Diffusion 2.1的二維碼條件ControlNet模型

本項目提供了基於二維碼條件的ControlNet模型，適用於Stable Diffusion 2.1。該模型可助力用戶在圖像生成過程中，依據二維碼條件生成特定圖像，為圖像創作帶來更多可能性。

🚀 快速開始

本模型提供了safetensors和diffusers兩種版本，適用於Stable Diffusion v2.1。Stable Diffusion 2.1版本效果稍好，是為滿足特定需求而開發的。不過，也在相同數據集上訓練了1.5版本的模型，供使用舊版本的用戶使用。

✨ 主要特性

支持Stable Diffusion 2.1和1.5版本。
可根據二維碼條件生成特定圖像。
提供safetensors和diffusers兩種版本。

📦 安裝指南

使用`diffusers`庫

pip -q install diffusers transformers accelerate torch xformers

在auto1111中使用

將.safetensors模型及其.yaml配置文件放在其他ControlNet模型的安裝文件夾中，具體位置因應用而異。在auto1111中，可將其放在webui/models/ControlNet文件夾中。可以使用ControlNet WebUI擴展加載模型，該擴展可通過WebUI的擴展選項卡安裝（https://github.com/Mikubill/sd-webui-controlnet）。

💻 使用示例

基礎用法

import torch
from PIL import Image
from diffusers import StableDiffusionControlNetImg2ImgPipeline, ControlNetModel, DDIMScheduler
from diffusers.utils import load_image

controlnet = ControlNetModel.from_pretrained("DionTimmer/controlnet_qrcode-control_v11p_sd21",
                                             torch_dtype=torch.float16)

pipe = StableDiffusionControlNetImg2ImgPipeline.from_pretrained(
    "stabilityai/stable-diffusion-2-1",
    controlnet=controlnet,
    safety_checker=None,
    torch_dtype=torch.float16
)

pipe.enable_xformers_memory_efficient_attention()
pipe.scheduler = DDIMScheduler.from_config(pipe.scheduler.config)
pipe.enable_model_cpu_offload()

def resize_for_condition_image(input_image: Image, resolution: int):
    input_image = input_image.convert("RGB")
    W, H = input_image.size
    k = float(resolution) / min(H, W)
    H *= k
    W *= k
    H = int(round(H / 64.0)) * 64
    W = int(round(W / 64.0)) * 64
    img = input_image.resize((W, H), resample=Image.LANCZOS)
    return img


# play with guidance_scale, controlnet_conditioning_scale and strength to make a valid QR Code Image

# qr code image
source_image = load_image("https://s3.amazonaws.com/moonup/production/uploads/6064e095abd8d3692e3e2ed6/A_RqHaAM6YHBodPLwqtjn.png")
# initial image, anything
init_image = load_image("https://s3.amazonaws.com/moonup/production/uploads/noauth/KfMBABpOwIuNolv1pe3qX.jpeg")
condition_image = resize_for_condition_image(source_image, 768)
init_image = resize_for_condition_image(init_image, 768)
generator = torch.manual_seed(123121231)
image = pipe(prompt="a bilboard in NYC with a qrcode",
             negative_prompt="ugly, disfigured, low quality, blurry, nsfw", 
             image=init_image,
             control_image=condition_image,
             width=768,
             height=768,
             guidance_scale=20,
             controlnet_conditioning_scale=1.5,
             generator=generator,
             strength=0.9, 
             num_inference_steps=150,
            )

image.images[0]

📚 詳細文檔

性能與侷限性

這些模型在大多數情況下表現良好，但請注意，它們並非100%準確。在某些情況下，二維碼形狀可能無法按預期呈現。可以增加ControlNet的權重以強調二維碼形狀，但要注意這可能會對輸出風格產生負面影響。為了優化掃描效果，請使用糾錯模式'H'（30%）生成二維碼。

為了在風格和形狀之間取得平衡，可能需要根據具體輸入、期望輸出以及正確的提示詞對控制權重進行微調。有些提示詞在大幅增加權重之前可能不起作用。找到這些因素之間的正確平衡既是一門藝術，也是一門科學。為了獲得最佳效果，建議以768的分辨率生成藝術作品，這樣可以在最終產品中實現更高的細節水平，提高基於二維碼的藝術作品的質量和效果。