animagine-xl-4.0-opt-pvc-style-lora開源模型 - 免費生成PVC手辦風格動漫角色圖像

首頁

Animagine Xl 4.0 Opt Pvc Style Lora

由p1atdev開發

基於AnimagineXL 4.0優化的LoRA模型，專門用於生成PVC手辦風格的動漫角色圖像

圖像生成開源協議:Apache-2.0 #PVC手辦風格 #動漫角色生成 #LoRA微調

下載量 63

發布時間 : 4/10/2025

模型概述

該模型是AnimagineXL 4.0的優化版本，通過LoRA技術微調，專注於生成高質量PVC手辦風格的動漫角色圖像。支持多種觸發詞生成不同風格的手辦效果。

模型特點

PVC風格生成

專門優化用於生成PVC手辦風格的動漫角色圖像

多觸發詞支持

支持使用pvc、nendoroid、figma等不同觸發詞生成不同風格的手辦效果

高質量輸出

生成圖像具有傑作級質量，最佳品質評分

LoRA微調

採用低秩適應(LoRA)技術對基礎模型進行高效微調

模型能力

文本生成圖像

動漫角色生成

PVC手辦風格轉換

高質量圖像生成

使用案例

數字藝術創作

動漫手辦概念設計

為動漫角色設計PVC手辦概念圖

生成高質量的手辦風格圖像

遊戲開發

遊戲角色手辦化

將遊戲角色轉換為手辦風格用於周邊產品設計

生成可用於商業產品的手辦風格圖像

🚀 PVC Style LoRA for AnimagineXL 4.0 opt

本項目是適用於AnimagineXL 4.0的PVC風格LoRA模型，可生成具有特定風格的圖像，在圖像生成領域具有較高的應用價值。

🚀 快速開始

本模型僅在ComfyUI中進行了測試，若要使用該模型進行圖像生成，可按以下步驟操作。

✨ 主要特性

特定觸發詞：使用pvc、nendoroid、figma作為觸發詞來生成特定風格的圖像。
格式支持：模型權重以Safetensors格式提供。

📦 安裝指南

模型權重可在Files & versions標籤頁中下載，點擊此處即可進行下載。

💻 使用示例

基礎用法

在使用ComfyUI時，輸入以下觸發詞來生成圖像：

pvc, nendoroid, figma

高級用法

可以結合更多的描述詞來生成更豐富的圖像，例如：

1girl, solo, blue background, halftone background, looking at viewer, animal ears, school uniform, yellow eyes, black hair, long hair, sitting, crossed legs, cat ears, border, halftone, white border, couch, open mouth, shirt, skirt, long sleeves, white shirt, black skirt, pleated skirt, serafuku, neckerchief, black sailor collar, socks, white socks, outside border, sailor collar, animal ear fluff, black neckerchief, masterpiece, best quality, high score, great score, latest, figma

同時，可設置負向提示詞來避免生成不理想的圖像：

lowres, bad anatomy, bad hands, text, error, missing finger, extra digits, fewer digits, cropped, worst quality, low quality, low score, bad score, average score, signature, watermark, username, blurry

📚 詳細文檔

數據集

使用了約7000張PVC手辦圖片作為訓練數據集。

訓練

訓練代碼

訓練代碼可參考：https://github.com/p1atdev/vision-ft

訓練設置

訓練設置的詳細信息可查看以下配置文件：

`config.yml`

model:
  checkpoint_path: "./models/animagine-xl-4.0-opt.bnb_nf4.safetensors"
  pretrained_model_name_or_path: cagliostrolab/animagine-xl-4.0

  dtype: bfloat16

  denoiser:
    attn_implementation: "flash_attention_2"

peft:
  config:
    type: lora
    rank: 2
    alpha: 1.0
    dropout: 0.0

    dtype: bfloat16

  include_keys:
    - "attn1"
    - "attn2"
    - ".ff."
  exclude_keys: ["text_encoder", "vae"]

dataset:
  folder: "data/pvc"
  num_repeats: 4
  batch_size: 4

  bucket_base_size: 1024
  step: 128
  min_size: 384
  do_upscale: true

  caption_processors:
    - type: shuffle
      split_separator: ","

optimizer:
  name: "schedulefree.RAdamScheduleFree"
  args:
    lr: 0.005

scheduler:
  # name: "torch.optim.lr_scheduler.ConstantLR"
  # args: {}

tracker:
  project_name: "sdxl-pvc-1"
  loggers:
    - wandb

saving:
  strategy:
    per_epochs: 0.25
    per_steps: null
    save_last: true

  callbacks:
    - type: "safentensors"
      name: "sdxl-pvc"
      save_dir: "./output/sdxl-pvc-2"

preview:
  strategy:
    per_epochs: 1
    per_steps: 100

  callbacks:
    - type: "local"
      save_dir: "./output/sdxl-lora/preview"

  data:
    path: "./projects/pvc-sdxl/preview.yml"

seed: 42
num_train_epochs: 10

trainer:
  # debug_mode: "1step"

  gradient_checkpointing: true

  torch_compile: true
  torch_compile_args:
    backend: eager
    mode: default
    fullgraph: false

  fp32_matmul_precision: "high"
  allow_tf32: true

`preview.yml`

- prompt: |-
    pvc style, 1girl, aqua eyes, baseball cap, blonde hair, closed mouth, earrings, 
    green background, hat, hoop earrings, jewelry, looking at viewer, 
    shirt, short hair, simple background, solo, upper body, yellow shirt, 
    masterpiece, high score, great score, absurdres
  negative_prompt: |-
    lowres, bad anatomy, bad hands, text, error, missing finger, extra digits, 
    fewer digits, cropped, worst quality, low quality, low score, bad score, 
    average score, signature, watermark, username, blurry
  height: 1024
  width: 1024
  cfg_scale: 5.0
  num_steps: 20

  seed: 0