🚀 PVC Style LoRA for AnimagineXL 4.0 opt
本项目是适用于AnimagineXL 4.0的PVC风格LoRA模型,可生成具有特定风格的图像,在图像生成领域具有较高的应用价值。
🚀 快速开始
本模型仅在ComfyUI中进行了测试,若要使用该模型进行图像生成,可按以下步骤操作。
✨ 主要特性
- 特定触发词:使用
pvc
、nendoroid
、figma
作为触发词来生成特定风格的图像。
- 格式支持:模型权重以Safetensors格式提供。
📦 安装指南
模型权重可在Files & versions标签页中下载,点击此处即可进行下载。
💻 使用示例
基础用法
在使用ComfyUI时,输入以下触发词来生成图像:
pvc, nendoroid, figma
高级用法
可以结合更多的描述词来生成更丰富的图像,例如:
1girl, solo, blue background, halftone background, looking at viewer, animal ears, school uniform, yellow eyes, black hair, long hair, sitting, crossed legs, cat ears, border, halftone, white border, couch, open mouth, shirt, skirt, long sleeves, white shirt, black skirt, pleated skirt, serafuku, neckerchief, black sailor collar, socks, white socks, outside border, sailor collar, animal ear fluff, black neckerchief, masterpiece, best quality, high score, great score, latest, figma
同时,可设置负向提示词来避免生成不理想的图像:
lowres, bad anatomy, bad hands, text, error, missing finger, extra digits, fewer digits, cropped, worst quality, low quality, low score, bad score, average score, signature, watermark, username, blurry
📚 详细文档
数据集
使用了约7000张PVC手办图片作为训练数据集。
训练
训练代码
训练代码可参考:https://github.com/p1atdev/vision-ft
训练设置
训练设置的详细信息可查看以下配置文件:
`config.yml`
model:
checkpoint_path: "./models/animagine-xl-4.0-opt.bnb_nf4.safetensors"
pretrained_model_name_or_path: cagliostrolab/animagine-xl-4.0
dtype: bfloat16
denoiser:
attn_implementation: "flash_attention_2"
peft:
config:
type: lora
rank: 2
alpha: 1.0
dropout: 0.0
dtype: bfloat16
include_keys:
- "attn1"
- "attn2"
- ".ff."
exclude_keys: ["text_encoder", "vae"]
dataset:
folder: "data/pvc"
num_repeats: 4
batch_size: 4
bucket_base_size: 1024
step: 128
min_size: 384
do_upscale: true
caption_processors:
- type: shuffle
split_separator: ","
optimizer:
name: "schedulefree.RAdamScheduleFree"
args:
lr: 0.005
scheduler:
tracker:
project_name: "sdxl-pvc-1"
loggers:
- wandb
saving:
strategy:
per_epochs: 0.25
per_steps: null
save_last: true
callbacks:
- type: "safentensors"
name: "sdxl-pvc"
save_dir: "./output/sdxl-pvc-2"
preview:
strategy:
per_epochs: 1
per_steps: 100
callbacks:
- type: "local"
save_dir: "./output/sdxl-lora/preview"
data:
path: "./projects/pvc-sdxl/preview.yml"
seed: 42
num_train_epochs: 10
trainer:
gradient_checkpointing: true
torch_compile: true
torch_compile_args:
backend: eager
mode: default
fullgraph: false
fp32_matmul_precision: "high"
allow_tf32: true
`preview.yml`
- prompt: |-
pvc style, 1girl, aqua eyes, baseball cap, blonde hair, closed mouth, earrings,
green background, hat, hoop earrings, jewelry, looking at viewer,
shirt, short hair, simple background, solo, upper body, yellow shirt,
masterpiece, high score, great score, absurdres
negative_prompt: |-
lowres, bad anatomy, bad hands, text, error, missing finger, extra digits,
fewer digits, cropped, worst quality, low quality, low score, bad score,
average score, signature, watermark, username, blurry
height: 1024
width: 1024
cfg_scale: 5.0
num_steps: 20
seed: 0
由于训练时房间过热,训练在第9个epoch停止。
设备和时长
使用RTX 4070 Ti Super(VRAM 16GB)进行了27小时的训练。
📄 许可证
本项目采用Apache-2.0许可证。