Flux.1 Schnell Pvc Style Lora
基于FLUX.1-schnell基础模型训练的LoRA适配器,专门用于生成PVC手办风格的动漫角色图像
下载量 162
发布时间 : 8/20/2024
模型简介
该模型通过LoRA技术适配FLUX.1-schnell基础模型,能够根据文本描述生成高质量的PVC手办风格图像,特别适合动漫角色创作。
模型特点
PVC手办风格生成
专门优化用于生成PVC手办、Q版黏土人和figma可动模型风格的动漫角色图像
LoRA技术适配
采用LoRA(Low-Rank Adaptation)技术对基础模型进行轻量级适配,保持高质量输出的同时减少计算资源需求
触发词控制
支持通过特定触发词(如'pvc手办'、'Q版黏土人')精确控制生成风格
模型能力
动漫角色生成
PVC手办风格转换
Q版角色生成
基于文本描述的图像生成
使用案例
动漫创作
角色概念设计
快速生成动漫角色的PVC手办风格概念图
示例图片展示了不同风格和姿势的动漫角色
周边产品设计
为手办、黏土人等周边产品生成设计原型
可生成适合商业生产的标准化角色图像
🚀 FLUX.1 schnell PVC风格
这是一个基于文本生成图像的项目,利用LoRA技术,基于diffusers
库实现。该项目可以根据输入的文本描述生成具有PVC风格的图像。
示例展示
以下是一些输入文本及其对应的生成图像示例:
输入文本 | 输出图像 |
---|---|
1girl is standing leftside by a blackboard rightside. pvc style. from front shot, 1girl; 1girl, nendoroid, blue hair, medium hair, cat ears, looking at viewer, white dress shirt, black shorts, arm up, open mouth, chibi. blackboard; blackboard with wooden frame and feet, text of "schnell LoRA workflow" on the board. | 点击查看 |
1girl, solo, full body, gloves, groin, hair between eyes, hair ornament, head tilt, holding, holding staff, horns, long hair, looking at viewer, maid headdress, mole, mole under eye, navel, parted lips, purple eyes, purple hair, purple theme, shawl, shorts, single horn, staff, standing, thighhighs, twintails, white footwear, white gloves, white thighhighs | 点击查看 |
1girl, solo, blue hair, cat ears, parted bangs, long hair, looking at viewer, white dress shirt, rainy, sitting | 点击查看 |
cute anime girl with massive fluffy fennec ears and a big fluffy tail blonde messy long hair blue eyes wearing a maid outfit with a long black gold leaf pattern dress and a white apron mouth open placing a fancy black forest cake with candles on top of a dinner table of an old dark Victorian mansion lit by candlelight with a bright window to the foggy forest and very expensive stuff everywhere there are paintings on the walls | 点击查看 |
1girl, solo, maid, cowboy shot, cup, green eyes, green hair, hair intakes, hair ornament, holding, holding cup, lips, long hair, looking at viewer, ponytail, realistic, simple background, teacup, very long hair, white background, wrist cuffs | 点击查看 |
1girl, hatsune miku, vocaloid, solo, :o, bare shoulders, black skirt, black sleeves, black thighhighs, blue eyes, blue hair, blush, collared shirt, detached sleeves, hair ornament, kneeling, long hair, long sleeves, looking at viewer, miniskirt, parted lips, pleated skirt, shirt, skirt, sleeveless, sleeveless shirt, thighhighs, twintails, very long hair, white shirt, wing collar | 点击查看 |
1girl, solo, outdoors, looking at viewer, flower, gloves, grey hair, hat, jacket, long hair, long skirt, long sleeves, looking at viewer, open clothes, open jacket, pantyhose, red eyes, red flower, red hat, red jacket, red rose, red skirt, rose, shirt, skirt, smile, snowing, standing, white gloves, white shirt | 点击查看 |
✨ 主要特性
- 文本生成图像:根据输入的文本描述生成具有PVC风格的图像。
- LoRA技术:利用低秩自适应(LoRA)技术进行模型训练,提高训练效率。
- 基于
diffusers
库:使用diffusers
库实现图像生成功能。
📦 安装指南
文档中未提及具体安装步骤,故跳过此章节。
💻 使用示例
触发词使用
你可以使用以下触发词来触发图像生成:
pvc figure
nendoroid
figma
提示指南
此LoRA模型使用danbooru标签进行训练,但并非所有danbooru标签都能生效。建议使用自然语言进行输入。
📚 详细文档
模型描述
这是一个用于FLUX.1-schnell的PVC风格LoRA模型。它基于FLUX.1-schnell模型,使用训练适配器和ostris/ai-toolkit进行训练。
下载模型
该模型的权重以Safetensors格式提供。你可以在文件与版本标签中下载。
训练配置
- WandB日志:点击查看
- 简要配置:
learning_rate
:1e-4
,采用constant
策略,使用AdamW8bit
优化器(默认)- 约使用2500张图像进行训练
batch_size
:4
,gradient_accumulation_steps
:1
,约需32GB显存- LoRA配置(默认):
linear
:16
linear_alpha
:16
- 基础模型:black-forest-labs/FLUX.1-schnell
- 辅助LoRA模型:ostris/FLUX.1-schnell-training-adapter
- 训练步数:10,000步,最终使用第7250步和第7500步的检查点以1:1的比例合并
- GPU:A6000 x1(显存48GB)
完整的config.yaml文件
job: extension
config:
name: flux_lora_pvc_schnell_1
process:
- type: sd_trainer
training_folder: output
device: cuda:3
network:
type: lora
linear: 16
linear_alpha: 16
save:
dtype: bfloat16
save_every: 250
max_step_saves_to_keep: 10
datasets:
- folder_path: /workspace/ai-toolkit/dataset/pvc
caption_ext: txt
caption_dropout_rate: 0.01
shuffle_tokens: true
cache_latents_to_disk: true
resolution:
- 768
- 1024
train:
batch_size: 4
steps: 10000
gradient_accumulation_steps: 1
train_unet: true
train_text_encoder: false
gradient_checkpointing: true
noise_scheduler: flowmatch
optimizer: adamw8bit
lr: 0.0001
ema_config:
use_ema: true
ema_decay: 0.99
dtype: bf16
model:
name_or_path: black-forest-labs/FLUX.1-schnell
assistant_lora_path: ostris/FLUX.1-schnell-training-adapter
is_flux: true
quantize: true
sample:
sampler: flowmatch
sample_every: 250
width: 832
height: 1152
prompts:
- 1girl, solo, blue hair, cat ears, parted bangs, long hair, looking at viewer,
white dress shirt, rainy, wariza, sitting
- 1girl, solo, full body, gloves, groin, hair between eyes, hair ornament, head
tilt, holding, holding staff, horns, long hair, looking at viewer, maid headdress,
mole, mole under eye, navel, parted lips, purple eyes, purple hair, purple
theme, shawl, shorts, single horn, staff, standing, thighhighs, twintails,
white footwear, white gloves, white thighhighs
- 1girl, aqua eyes, baseball cap, blonde hair, closed mouth, earrings, green
background, hat, hoop earrings, jewelry, looking at viewer, shirt, short hair,
solo, upper body, yellow shirt,
- 1girl, bangs, bare shoulders, beret, black hair, black shorts, blue hair,
bracelet, breasts, buttons, colored inner hair, double-breasted, eyewear removed,
green headwear, green jacket, grey eyes, grey sky, hat, jacket, jewelry, long
hair, looking at viewer, multicolored hair, neck ring, o-ring, off shoulder,
rain, round eyewear, shorts, sidelocks, small breasts, solo, sunglasses, wavy
hair, wet, zipper,
- 1girl, brown hair, green eyes, colorful, autumn, cumulonimbus clouds, lighting,
blue sky, falling leaves, garden
- 1girl, bangs, black wings, blush, choker, collarbone, feathered wings, hair
between eyes, halo, hand on hip, hat, head wings, long hair, long sleeves,
looking at viewer, pink eyes, pink hair, pleated skirt, school hat, school
uniform, serafuku, sidelocks, simple background, skirt, solo, twintails, wavy
mouth, white background, wings, waifu,
neg: ''
seed: 42
walk_seed: true
guidance_scale: 1
sample_steps: 4
logging:
use_wandb: true
project_name: flux_lora_pvc_schnell_1
run_name: run-1
meta:
name: flux_lora_schnell_pvc
version: '1.0'
📄 许可证
本项目采用Apache 2.0许可证。
Stable Diffusion V1 5
Openrail
稳定扩散是一种潜在的文本到图像扩散模型,能够根据任何文本输入生成逼真的图像。
图像生成
S
stable-diffusion-v1-5
3.7M
518
Stable Diffusion Inpainting
Openrail
基于稳定扩散的文本到图像生成模型,具备图像修复能力
图像生成
S
stable-diffusion-v1-5
3.3M
56
Stable Diffusion Xl Base 1.0
SDXL 1.0是基于扩散的文本生成图像模型,采用专家集成的潜在扩散流程,支持高分辨率图像生成
图像生成
S
stabilityai
2.4M
6,545
Stable Diffusion V1 4
Openrail
稳定扩散是一种潜在文本到图像扩散模型,能够根据任意文本输入生成逼真图像。
图像生成
S
CompVis
1.7M
6,778
Stable Diffusion Xl Refiner 1.0
SD-XL 1.0优化器模型是Stability AI开发的图像生成模型,专为提升SDXL基础模型生成的图像质量而设计,特别擅长最终去噪步骤处理。
图像生成
S
stabilityai
1.1M
1,882
Stable Diffusion 2 1
基于扩散的文本生成图像模型,支持通过文本提示生成和修改图像
图像生成
S
stabilityai
948.75k
3,966
Stable Diffusion Xl 1.0 Inpainting 0.1
基于Stable Diffusion XL的潜在文本到图像扩散模型,具备通过遮罩进行图像修复的功能
图像生成
S
diffusers
673.14k
334
Stable Diffusion 2 Base
基于扩散的文生图模型,可根据文本提示生成高质量图像
图像生成
S
stabilityai
613.60k
349
Playground V2.5 1024px Aesthetic
其他
开源文生图模型,能生成1024x1024分辨率及多种纵横比的美学图像,在美学质量上处于开源领域领先地位。
图像生成
P
playgroundai
554.94k
723
Sd Turbo
SD-Turbo是一款高速文本生成图像模型,仅需单次网络推理即可根据文本提示生成逼真图像。该模型作为研究原型发布,旨在探索小型蒸馏文本生成图像模型。
图像生成
S
stabilityai
502.82k
380
精选推荐AI模型
Llama 3 Typhoon V1.5x 8b Instruct
专为泰语设计的80亿参数指令模型,性能媲美GPT-3.5-turbo,优化了应用场景、检索增强生成、受限生成和推理任务
大型语言模型
Transformers 支持多种语言

L
scb10x
3,269
16
Cadet Tiny
Openrail
Cadet-Tiny是一个基于SODA数据集训练的超小型对话模型,专为边缘设备推理设计,体积仅为Cosmo-3B模型的2%左右。
对话系统
Transformers 英语

C
ToddGoldfarb
2,691
6
Roberta Base Chinese Extractive Qa
基于RoBERTa架构的中文抽取式问答模型,适用于从给定文本中提取答案的任务。
问答系统 中文
R
uer
2,694
98