PVC - V3開源AI模型 - 免費生成超逼真PVC手辦風格圖像

首頁

Pvc V3

由p1atdev開發

基於Waifu Diffusion v1.5 beta 2微調的潛在擴散模型，專用於生成PVC手辦風格的圖像

圖像生成英語開源協議:其他 #手辦風格生成 #動漫PVC材質 #可動關節模型

下載量 47

發布時間 : 3/1/2023

模型概述

該模型專門用於生成PVC手辦風格的動漫角色圖像，支持使用Danbooru標籤體系進行圖像生成，可模擬不同手辦類型如figma可動模型、nendoroid粘土人等風格

模型特點

多種手辦風格支持

可通過觸發詞(pvc/figma/nendoroid)生成不同類型的手辦風格圖像

動漫風格優化

特別優化了動漫風格表現，避免過於真實的渲染效果

Danbooru標籤兼容

支持使用Danbooru標籤體系進行精確的圖像生成控制

模型能力

動漫風格圖像生成

PVC手辦風格模擬

可動模型風格生成

粘土人風格生成

使用案例

數字藝術創作

手辦概念設計

為手辦設計師提供快速的概念可視化方案

生成具有商業手辦風格的概念圖像

動漫內容創作

角色周邊設計

為動漫角色生成周邊產品(如手辦)的預覽效果

生成可用於宣傳的高質量手辦風格圖像

🚀 PVC v3

PVC v3 是一個基於潛在擴散模型的圖像生成模型，它在 Waifu Diffusion v1.5 beta 2 的基礎上，使用 PVC 手辦圖像進行了微調。用戶可以使用 Danbooru 標籤來生成圖像。

🚀 快速開始

使用 🤗 的 Diffusers 庫可以簡單高效地運行 Stable Diffusion 2。

pip install diffusers transformers accelerate scipy safetensors
pip install --pre xformers

使用 StableDiffusionPipeline：

import torch
from diffusers import StableDiffusionPipeline

model_id = "p1atdev/pvc-v3"
revision = "fp16" # "main" or "fp16"

pipe = StableDiffusionPipeline.from_pretrained(
    model_id, 
    revision=revision, 
    torch_dtype=torch.float16,
)
pipe = pipe.to("cuda")
pipe.enable_attention_slicing()
pipe.enable_xformers_memory_efficient_attention() # required

prompt = "pvc, masterpiece, best quality, exceptional, 1girl, cat ears, red hair, long hair, hairpin, swept bangs, yellow eyes, black jacket, white shirt, blue tie, white gloves, hand up, upper body, looking at viewer, buildings"
negative_prompt = "nsfw, nude, worst quality, low quality, oldest, bad anatomy"
image = pipe(
    prompt, 
    negative_prompt=negative_prompt,
    guidance_scale=7.0,
    num_inference_steps=20
).images[0]

# save image
image.save("pvc_figure.png")

# or just display it
# display(image)

使用 StableDiffusionLongPromptWeightingPipeline：

import torch
from diffusers import DiffusionPipeline

model_id = "p1atdev/pvc-v3"
revision = "fp16" # "main" or "fp16"

pipe = DiffusionPipeline.from_pretrained(
    model_id, 
    revision=revision, 
    torch_dtype=torch.float16,
    custom_pipeline="lpw_stable_diffusion"
)
pipe = pipe.to("cuda")
pipe.enable_attention_slicing()
pipe.enable_xformers_memory_efficient_attention() # required

prompt = """
pvc, anime, masterpiece, best quality, exceptional,
1girl, bangs, bare shoulders, beret, black hair, black shorts, blue hair, bracelet, breasts, buttons,
colored inner hair, double-breasted, eyewear removed, green headwear, green jacket, grey eyes, grey sky,
hat, jacket, jewelry, long hair, looking at viewer, multicolored hair, neck ring, o-ring, off shoulder, rain,
round eyewear, shorts, sidelocks, small breasts, solo, sunglasses, wavy hair, wet, zipper
""" # long prompt

negative_prompt = "nsfw, nude, worst quality, low quality, oldest, bad anatomy"
image = pipe(
    prompt, 
    negative_prompt=negative_prompt,
    guidance_scale=7.0,
    num_inference_steps=20
).images[0]

display(image)

✨ 主要特性

基於 Waifu Diffusion v1.5 beta 2 微調，生成的圖像具有 PVC 手辦風格。
支持使用 Danbooru 標籤進行圖像生成。
提供多種模型版本供用戶選擇。

📦 安裝指南

pip install diffusers transformers accelerate scipy safetensors
pip install --pre xformers

💻 使用示例

基礎用法

import torch
from diffusers import StableDiffusionPipeline

model_id = "p1atdev/pvc-v3"
revision = "fp16" # "main" or "fp16"

pipe = StableDiffusionPipeline.from_pretrained(
    model_id, 
    revision=revision, 
    torch_dtype=torch.float16,
)
pipe = pipe.to("cuda")
pipe.enable_attention_slicing()
pipe.enable_xformers_memory_efficient_attention() # required

prompt = "pvc, masterpiece, best quality, exceptional, 1girl, cat ears, red hair, long hair, hairpin, swept bangs, yellow eyes, black jacket, white shirt, blue tie, white gloves, hand up, upper body, looking at viewer, buildings"
negative_prompt = "nsfw, nude, worst quality, low quality, oldest, bad anatomy"
image = pipe(
    prompt, 
    negative_prompt=negative_prompt,
    guidance_scale=7.0,
    num_inference_steps=20
).images[0]

# save image
image.save("pvc_figure.png")

# or just display it
# display(image)

高級用法

import torch
from diffusers import DiffusionPipeline

model_id = "p1atdev/pvc-v3"
revision = "fp16" # "main" or "fp16"

pipe = DiffusionPipeline.from_pretrained(
    model_id, 
    revision=revision, 
    torch_dtype=torch.float16,
    custom_pipeline="lpw_stable_diffusion"
)
pipe = pipe.to("cuda")
pipe.enable_attention_slicing()
pipe.enable_xformers_memory_efficient_attention() # required

prompt = """
pvc, anime, masterpiece, best quality, exceptional,
1girl, bangs, bare shoulders, beret, black hair, black shorts, blue hair, bracelet, breasts, buttons,
colored inner hair, double-breasted, eyewear removed, green headwear, green jacket, grey eyes, grey sky,
hat, jacket, jewelry, long hair, looking at viewer, multicolored hair, neck ring, o-ring, off shoulder, rain,
round eyewear, shorts, sidelocks, small breasts, solo, sunglasses, wavy hair, wet, zipper
""" # long prompt

negative_prompt = "nsfw, nude, worst quality, low quality, oldest, bad anatomy"
image = pipe(
    prompt, 
    negative_prompt=negative_prompt,
    guidance_scale=7.0,
    num_inference_steps=20
).images[0]

display(image)

📚 詳細文檔

下載

文件名	大小	鏈接
pvc-v3-fp16.safetensors	2.58 GB	點擊下載
pvc-v3-fp16.ckpt	2.58 GB	點擊下載
pvc-v3-fp32.safetensors	5.16 GB	點擊下載
pvc-v3-fp32.ckpt	5.16 GB	點擊下載

請使用 WD 的 vae 以獲得更好的效果！此外，你可以在負向提示詞中使用 badquality embedding！

提示詞指南

觸發詞

pvc 表示 PVC 材質風格，但並非總是必需。
figma 是具有關節的手辦風格，更傾向於產品縮略圖。與 doll joints 一起使用可獲得更好的關節效果。
nendoroid 表示黏土人風格。與 chibi 一起使用可獲得更好的效果。

提示

PVC 手辦風格更接近動漫風格，而非寫實風格。因此，有時建議在正向提示詞中加入 anime，或在負向提示詞中加入 realistic，以獲得更好的效果。如果你想避免生成過於寫實的面部，可以嘗試這種方法！

示例

這裡展示了一些使用該模型生成的圖像示例，以及對應的提示詞和參數設置。示例圖片1

正向提示詞：masterpiece, best quality, pvc, 1girl, cat ears, blue hair, gradient hair, colored inner hair, long hair, floating hair, blue eyes, school uniform, blue shirt, ribbon, short skirt, thighhighs, zettai ryouiki, school bag, from above, cowboy shot, looking at viewer, wind, street, day 負向提示詞：badquality, oldest, chibi 步數：28 採樣器：DPM++ SDE Karras CFG 比例：10 種子：744670484 尺寸：576x768 模型哈希值：0866b17d46 模型：pvc-v3-fp16 去噪強度：0.6 Clip 跳過：2 高分辨率放大：1.5 高分辨率上採樣器：Latent

（其他示例圖片及對應信息同理展示，此處省略）

訓練信息

參數	值
服務	Runpod
GPU	A5000
筆記本	Linaqruf/kohya-trainer
成本	約 2 美元
時長	約 6 小時
數據集	來自 p1atdev/pvc 的 7467 張圖像
分辨率	896
輪數	5
優化器	Lion
學習率	4e-7
調度器	cosine_with_restarts
訓練批次大小	1