Flux.1 Schnell Pvc Style Lora
基於FLUX.1-schnell基礎模型訓練的LoRA適配器,專門用於生成PVC手辦風格的動漫角色圖像
下載量 162
發布時間 : 8/20/2024
模型概述
該模型通過LoRA技術適配FLUX.1-schnell基礎模型,能夠根據文本描述生成高質量的PVC手辦風格圖像,特別適合動漫角色創作。
模型特點
PVC手辦風格生成
專門優化用於生成PVC手辦、Q版黏土人和figma可動模型風格的動漫角色圖像
LoRA技術適配
採用LoRA(Low-Rank Adaptation)技術對基礎模型進行輕量級適配,保持高質量輸出的同時減少計算資源需求
觸發詞控制
支持通過特定觸發詞(如'pvc手辦'、'Q版黏土人')精確控制生成風格
模型能力
動漫角色生成
PVC手辦風格轉換
Q版角色生成
基於文本描述的圖像生成
使用案例
動漫創作
角色概念設計
快速生成動漫角色的PVC手辦風格概念圖
示例圖片展示了不同風格和姿勢的動漫角色
周邊產品設計
為手辦、黏土人等周邊產品生成設計原型
可生成適合商業生產的標準化角色圖像
🚀 FLUX.1 schnell PVC風格
這是一個基於文本生成圖像的項目,利用LoRA技術,基於diffusers
庫實現。該項目可以根據輸入的文本描述生成具有PVC風格的圖像。
示例展示
以下是一些輸入文本及其對應的生成圖像示例:
輸入文本 | 輸出圖像 |
---|---|
1girl is standing leftside by a blackboard rightside. pvc style. from front shot, 1girl; 1girl, nendoroid, blue hair, medium hair, cat ears, looking at viewer, white dress shirt, black shorts, arm up, open mouth, chibi. blackboard; blackboard with wooden frame and feet, text of "schnell LoRA workflow" on the board. | 點擊查看 |
1girl, solo, full body, gloves, groin, hair between eyes, hair ornament, head tilt, holding, holding staff, horns, long hair, looking at viewer, maid headdress, mole, mole under eye, navel, parted lips, purple eyes, purple hair, purple theme, shawl, shorts, single horn, staff, standing, thighhighs, twintails, white footwear, white gloves, white thighhighs | 點擊查看 |
1girl, solo, blue hair, cat ears, parted bangs, long hair, looking at viewer, white dress shirt, rainy, sitting | 點擊查看 |
cute anime girl with massive fluffy fennec ears and a big fluffy tail blonde messy long hair blue eyes wearing a maid outfit with a long black gold leaf pattern dress and a white apron mouth open placing a fancy black forest cake with candles on top of a dinner table of an old dark Victorian mansion lit by candlelight with a bright window to the foggy forest and very expensive stuff everywhere there are paintings on the walls | 點擊查看 |
1girl, solo, maid, cowboy shot, cup, green eyes, green hair, hair intakes, hair ornament, holding, holding cup, lips, long hair, looking at viewer, ponytail, realistic, simple background, teacup, very long hair, white background, wrist cuffs | 點擊查看 |
1girl, hatsune miku, vocaloid, solo, :o, bare shoulders, black skirt, black sleeves, black thighhighs, blue eyes, blue hair, blush, collared shirt, detached sleeves, hair ornament, kneeling, long hair, long sleeves, looking at viewer, miniskirt, parted lips, pleated skirt, shirt, skirt, sleeveless, sleeveless shirt, thighhighs, twintails, very long hair, white shirt, wing collar | 點擊查看 |
1girl, solo, outdoors, looking at viewer, flower, gloves, grey hair, hat, jacket, long hair, long skirt, long sleeves, looking at viewer, open clothes, open jacket, pantyhose, red eyes, red flower, red hat, red jacket, red rose, red skirt, rose, shirt, skirt, smile, snowing, standing, white gloves, white shirt | 點擊查看 |
✨ 主要特性
- 文本生成圖像:根據輸入的文本描述生成具有PVC風格的圖像。
- LoRA技術:利用低秩自適應(LoRA)技術進行模型訓練,提高訓練效率。
- 基於
diffusers
庫:使用diffusers
庫實現圖像生成功能。
📦 安裝指南
文檔中未提及具體安裝步驟,故跳過此章節。
💻 使用示例
觸發詞使用
你可以使用以下觸發詞來觸發圖像生成:
pvc figure
nendoroid
figma
提示指南
此LoRA模型使用danbooru標籤進行訓練,但並非所有danbooru標籤都能生效。建議使用自然語言進行輸入。
📚 詳細文檔
模型描述
這是一個用於FLUX.1-schnell的PVC風格LoRA模型。它基於FLUX.1-schnell模型,使用訓練適配器和ostris/ai-toolkit進行訓練。
下載模型
該模型的權重以Safetensors格式提供。你可以在文件與版本標籤中下載。
訓練配置
- WandB日誌:點擊查看
- 簡要配置:
learning_rate
:1e-4
,採用constant
策略,使用AdamW8bit
優化器(默認)- 約使用2500張圖像進行訓練
batch_size
:4
,gradient_accumulation_steps
:1
,約需32GB顯存- LoRA配置(默認):
linear
:16
linear_alpha
:16
- 基礎模型:black-forest-labs/FLUX.1-schnell
- 輔助LoRA模型:ostris/FLUX.1-schnell-training-adapter
- 訓練步數:10,000步,最終使用第7250步和第7500步的檢查點以1:1的比例合併
- GPU:A6000 x1(顯存48GB)
完整的config.yaml文件
job: extension
config:
name: flux_lora_pvc_schnell_1
process:
- type: sd_trainer
training_folder: output
device: cuda:3
network:
type: lora
linear: 16
linear_alpha: 16
save:
dtype: bfloat16
save_every: 250
max_step_saves_to_keep: 10
datasets:
- folder_path: /workspace/ai-toolkit/dataset/pvc
caption_ext: txt
caption_dropout_rate: 0.01
shuffle_tokens: true
cache_latents_to_disk: true
resolution:
- 768
- 1024
train:
batch_size: 4
steps: 10000
gradient_accumulation_steps: 1
train_unet: true
train_text_encoder: false
gradient_checkpointing: true
noise_scheduler: flowmatch
optimizer: adamw8bit
lr: 0.0001
ema_config:
use_ema: true
ema_decay: 0.99
dtype: bf16
model:
name_or_path: black-forest-labs/FLUX.1-schnell
assistant_lora_path: ostris/FLUX.1-schnell-training-adapter
is_flux: true
quantize: true
sample:
sampler: flowmatch
sample_every: 250
width: 832
height: 1152
prompts:
- 1girl, solo, blue hair, cat ears, parted bangs, long hair, looking at viewer,
white dress shirt, rainy, wariza, sitting
- 1girl, solo, full body, gloves, groin, hair between eyes, hair ornament, head
tilt, holding, holding staff, horns, long hair, looking at viewer, maid headdress,
mole, mole under eye, navel, parted lips, purple eyes, purple hair, purple
theme, shawl, shorts, single horn, staff, standing, thighhighs, twintails,
white footwear, white gloves, white thighhighs
- 1girl, aqua eyes, baseball cap, blonde hair, closed mouth, earrings, green
background, hat, hoop earrings, jewelry, looking at viewer, shirt, short hair,
solo, upper body, yellow shirt,
- 1girl, bangs, bare shoulders, beret, black hair, black shorts, blue hair,
bracelet, breasts, buttons, colored inner hair, double-breasted, eyewear removed,
green headwear, green jacket, grey eyes, grey sky, hat, jacket, jewelry, long
hair, looking at viewer, multicolored hair, neck ring, o-ring, off shoulder,
rain, round eyewear, shorts, sidelocks, small breasts, solo, sunglasses, wavy
hair, wet, zipper,
- 1girl, brown hair, green eyes, colorful, autumn, cumulonimbus clouds, lighting,
blue sky, falling leaves, garden
- 1girl, bangs, black wings, blush, choker, collarbone, feathered wings, hair
between eyes, halo, hand on hip, hat, head wings, long hair, long sleeves,
looking at viewer, pink eyes, pink hair, pleated skirt, school hat, school
uniform, serafuku, sidelocks, simple background, skirt, solo, twintails, wavy
mouth, white background, wings, waifu,
neg: ''
seed: 42
walk_seed: true
guidance_scale: 1
sample_steps: 4
logging:
use_wandb: true
project_name: flux_lora_pvc_schnell_1
run_name: run-1
meta:
name: flux_lora_schnell_pvc
version: '1.0'
📄 許可證
本項目採用Apache 2.0許可證。
Stable Diffusion V1 5
Openrail
穩定擴散是一種潛在的文本到圖像擴散模型,能夠根據任何文本輸入生成逼真的圖像。
圖像生成
S
stable-diffusion-v1-5
3.7M
518
Stable Diffusion Inpainting
Openrail
基於穩定擴散的文本到圖像生成模型,具備圖像修復能力
圖像生成
S
stable-diffusion-v1-5
3.3M
56
Stable Diffusion Xl Base 1.0
SDXL 1.0是基於擴散的文本生成圖像模型,採用專家集成的潛在擴散流程,支持高分辨率圖像生成
圖像生成
S
stabilityai
2.4M
6,545
Stable Diffusion V1 4
Openrail
穩定擴散是一種潛在文本到圖像擴散模型,能夠根據任意文本輸入生成逼真圖像。
圖像生成
S
CompVis
1.7M
6,778
Stable Diffusion Xl Refiner 1.0
SD-XL 1.0優化器模型是Stability AI開發的圖像生成模型,專為提升SDXL基礎模型生成的圖像質量而設計,特別擅長最終去噪步驟處理。
圖像生成
S
stabilityai
1.1M
1,882
Stable Diffusion 2 1
基於擴散的文本生成圖像模型,支持通過文本提示生成和修改圖像
圖像生成
S
stabilityai
948.75k
3,966
Stable Diffusion Xl 1.0 Inpainting 0.1
基於Stable Diffusion XL的潛在文本到圖像擴散模型,具備通過遮罩進行圖像修復的功能
圖像生成
S
diffusers
673.14k
334
Stable Diffusion 2 Base
基於擴散的文生圖模型,可根據文本提示生成高質量圖像
圖像生成
S
stabilityai
613.60k
349
Playground V2.5 1024px Aesthetic
其他
開源文生圖模型,能生成1024x1024分辨率及多種縱橫比的美學圖像,在美學質量上處於開源領域領先地位。
圖像生成
P
playgroundai
554.94k
723
Sd Turbo
SD-Turbo是一款高速文本生成圖像模型,僅需單次網絡推理即可根據文本提示生成逼真圖像。該模型作為研究原型發佈,旨在探索小型蒸餾文本生成圖像模型。
圖像生成
S
stabilityai
502.82k
380
精選推薦AI模型
Llama 3 Typhoon V1.5x 8b Instruct
專為泰語設計的80億參數指令模型,性能媲美GPT-3.5-turbo,優化了應用場景、檢索增強生成、受限生成和推理任務
大型語言模型
Transformers 支持多種語言

L
scb10x
3,269
16
Cadet Tiny
Openrail
Cadet-Tiny是一個基於SODA數據集訓練的超小型對話模型,專為邊緣設備推理設計,體積僅為Cosmo-3B模型的2%左右。
對話系統
Transformers 英語

C
ToddGoldfarb
2,691
6
Roberta Base Chinese Extractive Qa
基於RoBERTa架構的中文抽取式問答模型,適用於從給定文本中提取答案的任務。
問答系統 中文
R
uer
2,694
98