🚀 PVC Style LoRA for AnimagineXL 4.0 opt
本項目是適用於AnimagineXL 4.0的PVC風格LoRA模型,可生成具有特定風格的圖像,在圖像生成領域具有較高的應用價值。
🚀 快速開始
本模型僅在ComfyUI中進行了測試,若要使用該模型進行圖像生成,可按以下步驟操作。
✨ 主要特性
- 特定觸發詞:使用
pvc
、nendoroid
、figma
作為觸發詞來生成特定風格的圖像。
- 格式支持:模型權重以Safetensors格式提供。
📦 安裝指南
模型權重可在Files & versions標籤頁中下載,點擊此處即可進行下載。
💻 使用示例
基礎用法
在使用ComfyUI時,輸入以下觸發詞來生成圖像:
pvc, nendoroid, figma
高級用法
可以結合更多的描述詞來生成更豐富的圖像,例如:
1girl, solo, blue background, halftone background, looking at viewer, animal ears, school uniform, yellow eyes, black hair, long hair, sitting, crossed legs, cat ears, border, halftone, white border, couch, open mouth, shirt, skirt, long sleeves, white shirt, black skirt, pleated skirt, serafuku, neckerchief, black sailor collar, socks, white socks, outside border, sailor collar, animal ear fluff, black neckerchief, masterpiece, best quality, high score, great score, latest, figma
同時,可設置負向提示詞來避免生成不理想的圖像:
lowres, bad anatomy, bad hands, text, error, missing finger, extra digits, fewer digits, cropped, worst quality, low quality, low score, bad score, average score, signature, watermark, username, blurry
📚 詳細文檔
數據集
使用了約7000張PVC手辦圖片作為訓練數據集。
訓練
訓練代碼
訓練代碼可參考:https://github.com/p1atdev/vision-ft
訓練設置
訓練設置的詳細信息可查看以下配置文件:
`config.yml`
model:
checkpoint_path: "./models/animagine-xl-4.0-opt.bnb_nf4.safetensors"
pretrained_model_name_or_path: cagliostrolab/animagine-xl-4.0
dtype: bfloat16
denoiser:
attn_implementation: "flash_attention_2"
peft:
config:
type: lora
rank: 2
alpha: 1.0
dropout: 0.0
dtype: bfloat16
include_keys:
- "attn1"
- "attn2"
- ".ff."
exclude_keys: ["text_encoder", "vae"]
dataset:
folder: "data/pvc"
num_repeats: 4
batch_size: 4
bucket_base_size: 1024
step: 128
min_size: 384
do_upscale: true
caption_processors:
- type: shuffle
split_separator: ","
optimizer:
name: "schedulefree.RAdamScheduleFree"
args:
lr: 0.005
scheduler:
tracker:
project_name: "sdxl-pvc-1"
loggers:
- wandb
saving:
strategy:
per_epochs: 0.25
per_steps: null
save_last: true
callbacks:
- type: "safentensors"
name: "sdxl-pvc"
save_dir: "./output/sdxl-pvc-2"
preview:
strategy:
per_epochs: 1
per_steps: 100
callbacks:
- type: "local"
save_dir: "./output/sdxl-lora/preview"
data:
path: "./projects/pvc-sdxl/preview.yml"
seed: 42
num_train_epochs: 10
trainer:
gradient_checkpointing: true
torch_compile: true
torch_compile_args:
backend: eager
mode: default
fullgraph: false
fp32_matmul_precision: "high"
allow_tf32: true
`preview.yml`
- prompt: |-
pvc style, 1girl, aqua eyes, baseball cap, blonde hair, closed mouth, earrings,
green background, hat, hoop earrings, jewelry, looking at viewer,
shirt, short hair, simple background, solo, upper body, yellow shirt,
masterpiece, high score, great score, absurdres
negative_prompt: |-
lowres, bad anatomy, bad hands, text, error, missing finger, extra digits,
fewer digits, cropped, worst quality, low quality, low score, bad score,
average score, signature, watermark, username, blurry
height: 1024
width: 1024
cfg_scale: 5.0
num_steps: 20
seed: 0
由於訓練時房間過熱,訓練在第9個epoch停止。
設備和時長
使用RTX 4070 Ti Super(VRAM 16GB)進行了27小時的訓練。
📄 許可證
本項目採用Apache-2.0許可證。