EcomXL_controlnet_inpaint開源文生圖模型 - 電商場景優化，防止前景外溢

首頁

Ecomxl Controlnet Inpaint

由alimama-creative開發

專為電商場景優化的文生圖擴散模型，基於Stable Diffusion XL開發，通過實例掩碼微調防止前景外溢。

圖像生成英語開源協議:Apache-2.0 #電商圖像修復 #實例掩碼控制 #SDXL優化

下載量 245

發布時間 : 5/7/2024

模型概述

EcomXL包含一系列專為電商場景優化的文生圖擴散模型，通過修復控制網絡調控擴散模型，特別針對電商需求訓練，有效防止前景外溢。

模型特點

電商場景優化

專為電商需求設計，優化了商品展示和修復效果。

實例掩碼微調

通過實例掩碼微調，有效防止前景外溢，提升修復精度。

高分辨率支持

支持1024x1024高分辨率圖像生成，適合電商高清展示需求。

模型能力

文本生成圖像

圖像修復

電商商品展示優化

使用案例

電商

商品展示修復

修復商品圖像中的缺陷或遮擋部分，提升展示效果。

修復後的圖像保持商品細節，無明顯外溢或失真。

背景替換

替換商品背景，適應不同展示場景需求。

背景替換自然，商品邊緣清晰無鋸齒。

🚀 EcomXL Inpaint ControlNet

EcomXL 包含一系列專為電商場景優化的文本到圖像擴散模型，它基於 Stable Diffusion XL 開發。針對電商場景，我們訓練了 Inpaint ControlNet 來控制擴散模型。與用於一般場景的 inpaint controlnets 不同，該模型使用實例掩碼進行微調，以防止前景外繪。

✨ 主要特性

專為電商場景優化的文本到圖像擴散模型。
使用實例掩碼微調，防止前景外繪。

📦 安裝指南

文檔未提供安裝步驟，跳過此章節。

💻 使用示例

基礎用法

from diffusers import (
    ControlNetModel,
    StableDiffusionXLControlNetPipeline,
    DDPMScheduler
)
from diffusers.utils import load_image
import torch
from PIL import Image
import numpy as np

def make_inpaint_condition(init_image, mask_image):
    init_image = np.array(init_image.convert("RGB")).astype(np.float32) / 255.0
    mask_image = np.array(mask_image.convert("L")).astype(np.float32) / 255.0
    assert init_image.shape[0:1] == mask_image.shape[0:1], "image and image_mask must have the same image size"
    init_image[mask_image > 0.5] = -1.0  # set as masked pixel
    init_image = np.expand_dims(init_image, 0).transpose(0, 3, 1, 2)
    init_image = torch.from_numpy(init_image)
    return init_image

def add_fg(full_img, fg_img, mask_img):
    full_img = np.array(full_img).astype(np.float32)
    fg_img = np.array(fg_img).astype(np.float32)
    mask_img = np.array(mask_img).astype(np.float32) / 255.
    full_img = full_img * mask_img + fg_img * (1-mask_img)
    return Image.fromarray(np.clip(full_img, 0, 255).astype(np.uint8))

controlnet = ControlNetModel.from_pretrained(
    "alimama-creative/EcomXL_controlnet_inpaint",
    use_safetensors=True,
)

pipe = StableDiffusionXLControlNetPipeline.from_pretrained(
    "stabilityai/stable-diffusion-xl-base-1.0", 
    controlnet=controlnet, 
)
pipe.to("cuda")
pipe.scheduler = DDPMScheduler.from_config(pipe.scheduler.config)

image = load_image(
    "https://huggingface.co/alimama-creative/EcomXL_controlnet_inpaint/resolve/main/images/inp_0.png"
)
mask = load_image(
    "https://huggingface.co/alimama-creative/EcomXL_controlnet_inpaint/resolve/main/images/inp_1.png"
)
mask = Image.fromarray(255 - np.array(mask))

control_image = make_inpaint_condition(image, mask)

prompt="a product on the table"

generator = torch.Generator(device="cuda").manual_seed(1234)

res_image = pipe(
    prompt,
    image=control_image,
    num_inference_steps=25,
    guidance_scale=7,
    width=1024,
    height=1024,
    controlnet_conditioning_scale=0.5,
    generator=generator,
).images[0]

res_image = add_fg(res_image, image, mask)
res_image.save(f'res.png')