DashAnimeXL-V1開源文生圖模型 - 免費生成高質量動漫圖像，手部刻畫更精準

首頁

Dashanimexl V1

由dashtoon開發

DashAnimeXL V1是基於SDXL微調的文生圖模型，專為生成高質量動漫圖像而設計，具有增強的手部解剖結構和更好的提示理解能力。

圖像生成英語#動漫風格生成 #高質量手部細節 #SDXL微調

下載量 61

發布時間 : 8/1/2024

模型概述

該模型是基於擴散模型的文生圖生成模型，專注於從文本提示生成高質量的動漫風格圖像。

模型特點

增強的手部解剖結構

模型在生成動漫人物時能更好地處理手部細節

改進的概念理解

對複雜提示的理解能力更強

優化的提示解析

能更準確地解析和實現文本提示中的要求

高質量動漫風格

專門針對動漫風格圖像生成進行了優化

模型能力

文本到圖像生成

動漫風格圖像創作

高分辨率圖像生成

風格化圖像生成

使用案例

數字藝術創作

動漫角色設計

根據文本描述生成獨特的動漫角色形象

高質量、風格一致的動漫角色圖像

概念藝術創作

快速生成動漫風格的概念藝術圖

可用於遊戲、動畫等項目的概念設計

內容創作

插畫生成

為故事或文章生成配套的動漫風格插畫

風格統一的系列插畫作品

🚀 DashAnimeXL V1

DashAnimeXL V1 是一款基於擴散模型的文本到圖像生成模型。該模型由 Dashtoon 研究團隊在 SDXL 基礎上微調而來，能夠根據文本提示生成高質量的動漫圖像。

🚀 快速開始

DashAnimeXL V1 是一款基於擴散模型的文本到圖像生成模型。若要使用該模型，需先安裝所需庫：

pip install diffusers --upgrade
pip install transformers accelerate safetensors

以下是使用 DashAnimeXL V1 生成圖像的示例代碼：

import torch
from diffusers import (
    StableDiffusionXLPipeline, 
    EulerAncestralDiscreteScheduler,
    AutoencoderKL
)

# Load VAE component
vae = AutoencoderKL.from_pretrained(
    "madebyollin/sdxl-vae-fp16-fix", 
    torch_dtype=torch.bfloat16
)

# Configure the pipeline
pipe = StableDiffusionXLPipeline.from_pretrained(
    "dashtoon/DashAnimeXL-V1", 
    vae=vae,
    torch_dtype=torch.bfloat16, 
    use_safetensors=True, 
)
pipe.scheduler = EulerAncestralDiscreteScheduler.from_config(pipe.scheduler.config)

if torch.cuda.is_available():
  pipe.to('cuda')

# Define prompts and generate image
prompt = "anime illustration, An ink painting with a superhot, pop art style, featuring vibrant splashes and gradient patterns merging with random signals and noise. A zoomed-in panda wearing glasses, appearing to look directly at the viewer. The piece is bathed in warm, volumetric lighting against a clear dusk sky background. The reflection in the panda's sunglasses reveals nuclear clouds, adding an element of surrealism."
negative_prompt = "nsfw, low quality, worst quality, very displeasing, 3d, watermark, signature, ugly, poorly drawn"

image = pipe(
    prompt, 
    negative_prompt=negative_prompt, 
    width=1024,
    height=1024,
    guidance_scale=7,
    num_inference_steps=20
).images[0]

✨ 主要特性

高質量動漫圖像生成：DashAnimeXL V1 能夠根據文本提示生成高質量的動漫圖像。
增強的手部解剖結構：該模型在生成圖像時，對手部解剖結構的表現更加準確。
更好的概念理解和提示解釋：能夠更好地理解文本提示中的概念，並生成符合要求的圖像。

📦 安裝指南

若要使用 DashAnimeXL V1，需安裝以下庫：

pip install diffusers --upgrade
pip install transformers accelerate safetensors

💻 使用示例

基礎用法

import torch
from diffusers import (
    StableDiffusionXLPipeline, 
    EulerAncestralDiscreteScheduler,
    AutoencoderKL
)

# Load VAE component
vae = AutoencoderKL.from_pretrained(
    "madebyollin/sdxl-vae-fp16-fix", 
    torch_dtype=torch.bfloat16
)

# Configure the pipeline
pipe = StableDiffusionXLPipeline.from_pretrained(
    "dashtoon/DashAnimeXL-V1", 
    vae=vae,
    torch_dtype=torch.bfloat16, 
    use_safetensors=True, 
)
pipe.scheduler = EulerAncestralDiscreteScheduler.from_config(pipe.scheduler.config)

if torch.cuda.is_available():
  pipe.to('cuda')

# Define prompts and generate image
prompt = "anime illustration, An ink painting with a superhot, pop art style, featuring vibrant splashes and gradient patterns merging with random signals and noise. A zoomed-in panda wearing glasses, appearing to look directly at the viewer. The piece is bathed in warm, volumetric lighting against a clear dusk sky background. The reflection in the panda's sunglasses reveals nuclear clouds, adding an element of surrealism."
negative_prompt = "nsfw, low quality, worst quality, very displeasing, 3d, watermark, signature, ugly, poorly drawn"

image = pipe(
    prompt, 
    negative_prompt=negative_prompt, 
    width=1024,
    height=1024,
    guidance_scale=7,
    num_inference_steps=20
).images[0]

📚 詳細文檔

模型描述

屬性	詳情
開發者	Dashtoon
模型類型	基於擴散模型的文本到圖像生成模型
許可證	CreativeML Open RAIL++-M License
模型描述	DashAnimeXL V1 旨在根據文本提示生成高質量的動漫圖像。它具有增強的手部解剖結構、更好的概念理解和提示解釋能力。
總結	該模型根據文本提示生成圖像，採用了與 Stable Diffusion XL 相同的架構。
微調基礎模型	SDXL