sdxl - instructpix2pix - 768開源圖像編輯模型，用自然語言指令編輯圖像超方便

首頁

Sdxl Instructpix2pix 768

由diffusers開發

基於Stable Diffusion XL (SDXL)進行指令微調的圖像編輯模型，採用InstructPix2Pix方法，支持通過自然語言指令編輯圖像。

圖像生成 #圖像指令編輯 #SDXL微調 #藝術風格轉換

下載量 15.88k

發布時間 : 8/23/2023

模型概述

該模型是對Stable Diffusion XL (SDXL)進行指令微調的版本，專門用於根據文本指令編輯圖像。它能夠理解自然語言指令並相應地修改輸入圖像，如改變風格、調整內容等。

模型特點

自然語言圖像編輯

能夠理解自然語言指令並相應地修改輸入圖像

高分辨率處理

支持768x768分辨率的圖像編輯

多樣編輯能力

支持風格轉換、內容修改等多種編輯任務

模型能力

圖像編輯

風格轉換

內容修改

自然語言理解

使用案例

創意設計

藝術風格轉換

將普通照片轉換為畢加索等藝術風格

示例顯示成功將圖像轉換為畢加索風格

場景修改

修改圖像中的特定元素，如改變天空狀態

示例顯示成功將晴朗天空變為多雲

人物編輯

年齡變化

調整人物年齡表現

示例顯示成功使人物看起來更老

🚀 SDXL InstructPix2Pix (768768)

本項目基於InstructPix2Pix的方法，對Stable Diffusion XL (SDXL)進行指令微調。以下是一些生成結果示例：

編輯指令："將天空變為多雲的樣子"

編輯指令："將其變成畢加索風格的畫作"

編輯指令："讓人物看起來更老一些"

🚀 快速開始

📦 安裝指南

在使用之前，請確保安裝所需的庫：

pip install accelerate transformers
pip install git+https://github.com/huggingface/diffusers

💻 使用示例

基礎用法

import torch
from diffusers import StableDiffusionXLInstructPix2PixPipeline
from diffusers.utils import load_image

resolution = 768
image = load_image(
    "https://hf.co/datasets/diffusers/diffusers-images-docs/resolve/main/mountain.png"
).resize((resolution, resolution))
edit_instruction = "Turn sky into a cloudy one"

pipe = StableDiffusionXLInstructPix2PixPipeline.from_pretrained(
    "diffusers/sdxl-instructpix2pix-768", torch_dtype=torch.float16
).to("cuda")

edited_image = pipe(
    prompt=edit_instruction,
    image=image,
    height=resolution,
    width=resolution,
    guidance_scale=3.0,
    image_guidance_scale=1.5,
    num_inference_steps=30,
).images[0]
edited_image.save("edited_image.png")

更多詳細信息，請參考文檔。

⚠️ 重要提示

此檢查點本質上是實驗性的，有很大的改進空間。請使用本倉庫的“討論”標籤來提出問題和進行討論。

🔧 技術細節

訓練

我們使用InstructPix2Pix訓練方法對SDXL進行了15000步的微調，在768x768的圖像分辨率上使用了固定學習率5e - 6。

我們的訓練腳本和其他實用工具可以在這裡找到，它們是基於我們的官方訓練腳本構建的。

我們的訓練日誌可以在Weights and Biases上查看這裡。請參考此鏈接瞭解所有超參數的詳細信息。

訓練數據

我們使用了這個數據集：timbrooks/instructpix2pix-clip-filtered。

計算資源

使用一臺配備8張A100顯卡的機器。

批量大小

採用數據並行，單GPU批量大小為8，總批量大小為32。

混合精度

使用FP16。

📄 許可證

本項目採用OpenRAIL++許可證。

信息表格

屬性	詳情
基礎模型	stabilityai/stable-diffusion-xl-base-1.0
標籤	stable-diffusion-xl、stable-diffusion-xl-diffusers、text-to-image、diffusers、instruct-pix2pix
推理	否
數據集	timbrooks/instructpix2pix-clip-filtered
許可證	OpenRAIL++