picasso-diffusion-1-1开源图像生成AI - 免费打造高质量AI艺术作品

Home

Picasso Diffusion 1 1

Developed by aipicasso

专注于AI艺术的图像生成AI，开发耗费约7000 GPU小时，基于Stable Diffusion架构

图像生成 Open Source License:Other #日系动漫生成 #非商用艺术创作 #高精度肖像

Downloads 28

Release Time : 2/17/2023

Model Overview

一款基于扩散模型的文本到图像生成模型，专注于艺术创作，支持日语提示词

Model Features

艺术风格生成

专注于艺术创作，能生成高质量的动漫风格图像

日语优化

特别针对日语提示词进行了优化

非商业用途

许可证禁止商业用途，保护创意产业

Model Capabilities

文本到图像生成

艺术风格转换

高质量图像合成

Use Cases

艺术创作

动漫角色创作

生成动漫风格的原创角色肖像

高质量4K分辨率图像

教育用途

艺术院校学生用于毕业作品创作

新闻报道

AI艺术报道

媒体用于报道图像生成AI技术发展

🚀 Picasso Diffusion 1.1 模型卡

Picasso Diffusion 1.1 是一款专门为AI艺术打造的图像生成AI，经过约7000 GPU小时精心开发，能依据提示生成适配的图像。

标题：欢迎来到科学事实世界。

英文版本请见此处。

🚀 快速开始

若您想轻松体验，可使用此空间。您也能从 safetensors格式或 ckpt格式下载模型。

✨ 主要特性

Picasso Diffusion是一款特化于AI艺术的图像生成AI，基于约7000 GPU小时开发，算法采用 Latent Diffusion Model 与 OpenCLIP-ViT/H，可根据提示生成适配图像。

📦 安装指南

Web UI方式

如同Stable Diffusion v2的使用方法，您需将ckpt格式或safetensor格式的模型文件以及yaml格式的配置文件放入模型文件夹。详细安装方法请参考此文章。建议您安装xformers并开启 --xformers --disable-nan-check 选项；若未安装，可开启 --no-half 选项。

Diffusers方式

首先，执行以下脚本安装库：

pip install --upgrade git+https://github.com/huggingface/diffusers.git transformers accelerate scipy

接着，运行以下脚本生成图像：

from diffusers import StableDiffusionPipeline, EulerAncestralDiscreteScheduler
import torch

model_id = "alfredplpl/picasso-diffusion-1-1"

scheduler = EulerAncestralDiscreteScheduler.from_pretrained(model_id, subfolder="scheduler")
pipe = StableDiffusionPipeline.from_pretrained(model_id, scheduler=scheduler, torch_dtype=torch.float16)
pipe = pipe.to("cuda")

prompt = "anime, masterpiece, a portrait of a girl, good pupil, 4k, detailed"
negative_prompt="deformed, blurry, bad anatomy, bad pupil, disfigured, poorly drawn face, mutation, mutated, extra limb, ugly, poorly drawn hands, bad hands, fused fingers, messy drawing, broken legs censor, low quality, mutated hands and fingers, long body, mutation, poorly drawn, bad eyes, ui, error, missing fingers, fused fingers, one hand with more than 5 fingers, one hand with less than 5 fingers, one hand with more than 5 digit, one hand with less than 5 digit, extra digit, fewer digits, fused digit, missing digit, bad digit, liquid digit, long body, uncoordinated body, unnatural body, lowres, jpeg artifacts, 3d, cg, text, japanese kanji"
images = pipe(prompt,negative_prompt=negative_prompt, num_inference_steps=20).images
images[0].save("girl.png")

注意：

使用 xformers 可提升速度。
若您使用GPU且显存较少，可使用 pipe.enable_attention_slicing()。

💻 使用示例

基础用法

使用Diffusers库生成图像的基础示例如下：

from diffusers import StableDiffusionPipeline, EulerAncestralDiscreteScheduler
import torch

model_id = "alfredplpl/picasso-diffusion-1-1"

scheduler = EulerAncestralDiscreteScheduler.from_pretrained(model_id, subfolder="scheduler")
pipe = StableDiffusionPipeline.from_pretrained(model_id, scheduler=scheduler, torch_dtype=torch.float16)
pipe = pipe.to("cuda")

prompt = "anime, masterpiece, a portrait of a girl, good pupil, 4k, detailed"
negative_prompt="deformed, blurry, bad anatomy, bad pupil, disfigured, poorly drawn face, mutation, mutated, extra limb, ugly, poorly drawn hands, bad hands, fused fingers, messy drawing, broken legs censor, low quality, mutated hands and fingers, long body, mutation, poorly drawn, bad eyes, ui, error, missing fingers, fused fingers, one hand with more than 5 fingers, one hand with less than 5 fingers, one hand with more than 5 digit, one hand with less than 5 digit, extra digit, fewer digits, fused digit, missing digit, bad digit, liquid digit, long body, uncoordinated body, unnatural body, lowres, jpeg artifacts, 3d, cg, text, japanese kanji"
images = pipe(prompt,negative_prompt=negative_prompt, num_inference_steps=20).images
images[0].save("girl.png")

高级用法

暂无高级用法示例。

📚 详细文档

模型详情

属性	详情
模型类型	基于扩散模型的文本到图像生成模型
语言	日语
许可证	CreativeML Open RAIL++-M-NC License
模型说明	此模型可根据提示生成适配图像，算法采用 Latent Diffusion Model 与 OpenCLIP-ViT/H
参考文献	@InProceedings{Rombach_2022_CVPR, author = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj"orn}, title = {High-Resolution Image Synthesis With Latent Diffusion Models}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2022}, pages = {10684-10695} }

想定用途

自我表达：借助此AI展现您的独特风格。
图像生成AI相关报道：无论是公共广播还是营利企业均可使用。
研究开发：
- 在Discord上使用模型，如进行提示工程、微调（包括追加学习，如DreamBooth）、与其他模型合并。
- 使用FID等评估模型性能。
- 使用校验和或哈希函数检查模型是否独立于Stable Diffusion以外的模型。
教育：适用于美术大学生、专科学校学生的毕业作品，大学生的毕业论文或课题作业，以及教师传授图像生成AI现状。
Hugging Face社区用途：请使用日语或英语提问。