Pixel-art-xl開源模型 - 基於Stable Diffusion XL生成高質量像素風格圖像

首頁

Pixel Art Xl

由nerijs開發

基於Stable Diffusion XL的像素藝術風格LoRA模型，可生成高質量的像素風格圖像

圖像生成開源協議:Openrail #像素風格生成 #LoRA加速優化 #8步快速出圖

下載量 6,667

發布時間 : 8/3/2023

模型概述

該模型是Stable Diffusion XL的LoRA適配器，專門用於生成各種像素藝術風格的圖像，支持等軸測與非等軸測風格，無需觸發詞即可使用

模型特點

高質量像素藝術

專門優化用於生成清晰、風格化的像素藝術作品

兼容LCM加速

可搭配LCM LoRA實現快速生成(僅需8步迭代)

無需觸發詞

不需要特定觸發詞即可生成像素風格圖像

多風格支持

同時支持等軸測和非等軸測像素風格

模型能力

像素風格圖像生成

風格化圖像轉換

快速圖像生成(配合LCM)

使用案例

遊戲開發

遊戲角色設計

生成像素風格遊戲角色素材

可直接用於2D遊戲的風格化角色

場景元素設計

創建像素風格的遊戲場景元素

統一的像素美術風格資源

數字藝術

像素藝術創作

快速生成像素風格藝術作品

具有復古風格的數字藝術作品

🚀 Pixel Art XL

Pixel Art XL 是一款基於 Stable Diffusion XL 的 LoRA 模型，專注於生成像素風圖像。它能夠將文本描述轉化為精美的像素藝術作品，為用戶帶來獨特的圖像生成體驗。

🚀 快速開始

環境準備

使用該模型前，你需要安裝必要的庫，以下是示例代碼：

from diffusers import DiffusionPipeline, LCMScheduler
import torch

模型加載

model_id = "stabilityai/stable-diffusion-xl-base-1.0"
lcm_lora_id = "latent-consistency/lcm-lora-sdxl"
pipe = DiffusionPipeline.from_pretrained(model_id, variant="fp16")
pipe.scheduler = LCMScheduler.from_config(pipe.scheduler.config)

pipe.load_lora_weights(lcm_lora_id, adapter_name="lora")
pipe.load_lora_weights("./pixel-art-xl.safetensors", adapter_name="pixel")

pipe.set_adapters(["lora", "pixel"], adapter_weights=[1.0, 1.2])
pipe.to(device="cuda", dtype=torch.float16)

圖像生成

prompt = "pixel, a cute corgi"
negative_prompt = "3d render, realistic"

num_images = 9

for i in range(num_images):
    img = pipe(
        prompt=prompt,
        negative_prompt=negative_prompt,
        num_inference_steps=8,
        guidance_scale=1.5,
    ).images[0]
    
    img.save(f"lcm_lora_{i}.png")

✨ 主要特性

像素完美：通過將圖像下采樣 8 次（使用最近鄰插值法），可以獲得像素完美的圖像。
減少偽影：使用固定的 VAE（如 0.9 或 fp16 修復）可以避免生成圖像出現偽影。
高性能：搭配 LCM LoRA 使用，僅需 8 步推理和 1.5 的引導比例，即可快速生成高質量圖像。
靈活性高：無需使用 refiner，僅使用 1 個文本編碼器即可工作，無需風格提示和觸發關鍵詞。

💻 使用示例

基礎用法

from diffusers import DiffusionPipeline, LCMScheduler
import torch

model_id = "stabilityai/stable-diffusion-xl-base-1.0"
lcm_lora_id = "latent-consistency/lcm-lora-sdxl"
pipe = DiffusionPipeline.from_pretrained(model_id, variant="fp16")
pipe.scheduler = LCMScheduler.from_config(pipe.scheduler.config)

pipe.load_lora_weights(lcm_lora_id, adapter_name="lora")
pipe.load_lora_weights("./pixel-art-xl.safetensors", adapter_name="pixel")

pipe.set_adapters(["lora", "pixel"], adapter_weights=[1.0, 1.2])
pipe.to(device="cuda", dtype=torch.float16)

prompt = "pixel, a cute corgi"
negative_prompt = "3d render, realistic"

num_images = 9

for i in range(num_images):
    img = pipe(
        prompt=prompt,
        negative_prompt=negative_prompt,
        num_inference_steps=8,
        guidance_scale=1.5,
    ).images[0]
    
    img.save(f"lcm_lora_{i}.png")

高級用法

如果你需要更高的性能，可以使用 LCM LoRA 並調整參數：

# 使用 LCM LoRA 提高性能
from diffusers import DiffusionPipeline, LCMScheduler
import torch

model_id = "stabilityai/stable-diffusion-xl-base-1.0"
lcm_lora_id = "latent-consistency/lcm-lora-sdxl"
pipe = DiffusionPipeline.from_pretrained(model_id, variant="fp16")
pipe.scheduler = LCMScheduler.from_config(pipe.scheduler.config)

pipe.load_lora_weights(lcm_lora_id, adapter_name="lora")
pipe.load_lora_weights("./pixel-art-xl.safetensors", adapter_name="pixel")

pipe.set_adapters(["lora", "pixel"], adapter_weights=[1.0, 1.2])
pipe.to(device="cuda", dtype=torch.float16)

prompt = "pixel, a cute corgi"
negative_prompt = "3d render, realistic"

# 僅需 8 步推理和 1.5 的引導比例
num_inference_steps = 8
guidance_scale = 1.5

num_images = 9

for i in range(num_images):
    img = pipe(
        prompt=prompt,
        negative_prompt=negative_prompt,
        num_inference_steps=num_inference_steps,
        guidance_scale=guidance_scale,
    ).images[0]
    
    img.save(f"lcm_lora_{i}.png")