DashAnimeXL-V1开源文生图模型 - 免费生成高质量动漫图像，手部刻画更精准

首页

Dashanimexl V1

由 dashtoon 开发

DashAnimeXL V1是基于SDXL微调的文生图模型，专为生成高质量动漫图像而设计，具有增强的手部解剖结构和更好的提示理解能力。

图像生成英语#动漫风格生成 #高质量手部细节 #SDXL微调

下载量 61

发布时间 : 8/1/2024

模型简介

该模型是基于扩散模型的文生图生成模型，专注于从文本提示生成高质量的动漫风格图像。

模型特点

增强的手部解剖结构

模型在生成动漫人物时能更好地处理手部细节

改进的概念理解

对复杂提示的理解能力更强

优化的提示解析

能更准确地解析和实现文本提示中的要求

高质量动漫风格

专门针对动漫风格图像生成进行了优化

模型能力

文本到图像生成

动漫风格图像创作

高分辨率图像生成

风格化图像生成

使用案例

数字艺术创作

动漫角色设计

根据文本描述生成独特的动漫角色形象

高质量、风格一致的动漫角色图像

概念艺术创作

快速生成动漫风格的概念艺术图

可用于游戏、动画等项目的概念设计

内容创作

插画生成

为故事或文章生成配套的动漫风格插画

风格统一的系列插画作品

🚀 DashAnimeXL V1

DashAnimeXL V1 是一款基于扩散模型的文本到图像生成模型。该模型由 Dashtoon 研究团队在 SDXL 基础上微调而来，能够根据文本提示生成高质量的动漫图像。

🚀 快速开始

DashAnimeXL V1 是一款基于扩散模型的文本到图像生成模型。若要使用该模型，需先安装所需库：

pip install diffusers --upgrade
pip install transformers accelerate safetensors

以下是使用 DashAnimeXL V1 生成图像的示例代码：

import torch
from diffusers import (
    StableDiffusionXLPipeline, 
    EulerAncestralDiscreteScheduler,
    AutoencoderKL
)

# Load VAE component
vae = AutoencoderKL.from_pretrained(
    "madebyollin/sdxl-vae-fp16-fix", 
    torch_dtype=torch.bfloat16
)

# Configure the pipeline
pipe = StableDiffusionXLPipeline.from_pretrained(
    "dashtoon/DashAnimeXL-V1", 
    vae=vae,
    torch_dtype=torch.bfloat16, 
    use_safetensors=True, 
)
pipe.scheduler = EulerAncestralDiscreteScheduler.from_config(pipe.scheduler.config)

if torch.cuda.is_available():
  pipe.to('cuda')

# Define prompts and generate image
prompt = "anime illustration, An ink painting with a superhot, pop art style, featuring vibrant splashes and gradient patterns merging with random signals and noise. A zoomed-in panda wearing glasses, appearing to look directly at the viewer. The piece is bathed in warm, volumetric lighting against a clear dusk sky background. The reflection in the panda's sunglasses reveals nuclear clouds, adding an element of surrealism."
negative_prompt = "nsfw, low quality, worst quality, very displeasing, 3d, watermark, signature, ugly, poorly drawn"

image = pipe(
    prompt, 
    negative_prompt=negative_prompt, 
    width=1024,
    height=1024,
    guidance_scale=7,
    num_inference_steps=20
).images[0]

✨ 主要特性

高质量动漫图像生成：DashAnimeXL V1 能够根据文本提示生成高质量的动漫图像。
增强的手部解剖结构：该模型在生成图像时，对手部解剖结构的表现更加准确。
更好的概念理解和提示解释：能够更好地理解文本提示中的概念，并生成符合要求的图像。

📦 安装指南

若要使用 DashAnimeXL V1，需安装以下库：

pip install diffusers --upgrade
pip install transformers accelerate safetensors

💻 使用示例

基础用法

import torch
from diffusers import (
    StableDiffusionXLPipeline, 
    EulerAncestralDiscreteScheduler,
    AutoencoderKL
)

# Load VAE component
vae = AutoencoderKL.from_pretrained(
    "madebyollin/sdxl-vae-fp16-fix", 
    torch_dtype=torch.bfloat16
)

# Configure the pipeline
pipe = StableDiffusionXLPipeline.from_pretrained(
    "dashtoon/DashAnimeXL-V1", 
    vae=vae,
    torch_dtype=torch.bfloat16, 
    use_safetensors=True, 
)
pipe.scheduler = EulerAncestralDiscreteScheduler.from_config(pipe.scheduler.config)

if torch.cuda.is_available():
  pipe.to('cuda')

# Define prompts and generate image
prompt = "anime illustration, An ink painting with a superhot, pop art style, featuring vibrant splashes and gradient patterns merging with random signals and noise. A zoomed-in panda wearing glasses, appearing to look directly at the viewer. The piece is bathed in warm, volumetric lighting against a clear dusk sky background. The reflection in the panda's sunglasses reveals nuclear clouds, adding an element of surrealism."
negative_prompt = "nsfw, low quality, worst quality, very displeasing, 3d, watermark, signature, ugly, poorly drawn"

image = pipe(
    prompt, 
    negative_prompt=negative_prompt, 
    width=1024,
    height=1024,
    guidance_scale=7,
    num_inference_steps=20
).images[0]

📚 详细文档

模型描述

属性	详情
开发者	Dashtoon
模型类型	基于扩散模型的文本到图像生成模型
许可证	CreativeML Open RAIL++-M License
模型描述	DashAnimeXL V1 旨在根据文本提示生成高质量的动漫图像。它具有增强的手部解剖结构、更好的概念理解和提示解释能力。
总结	该模型根据文本提示生成图像，采用了与 Stable Diffusion XL 相同的架构。
微调基础模型	SDXL