🚀 Evt_V4-preview
EVT系列是一個在動畫風格模型上使用大型數據集進行微調的實驗項目。Evt_V4使用了比以往更大的數據集,其與ACertainty的餘弦相似度達到了85%。它的表現可能與其他模型不同,希望您能喜歡它。
🚀 快速開始
本模型的使用方法與其他Stable Diffusion模型相同。更多信息,請查看Stable Diffusion。
您還可以將模型導出為ONNX、MPS和/或FLAX/JAX格式。
💻 使用示例
基礎用法
from diffusers import StableDiffusionPipeline
import torch
model_id = "haor/Evt_V4-preview"
branch_name= "main"
pipe = StableDiffusionPipeline.from_pretrained(model_id, revision=branch_name, torch_dtype=torch.float16)
pipe = pipe.to("cuda")
prompt = "1girl"
image = pipe(prompt).images[0]
image.save("./1girl.png")
示例展示
提示詞1

1girl in black serafuku standing in a field solo, food, fruit, lemon, bubble, planet, moon, orange \(fruit\), lemon slice, leaf, fish, orange slice, by (tabi:1.25), spot color, looking at viewer, closeup cowboy shot
Negative prompt: (bad:0.81), (comic:0.81), (cropped:0.81), (error:0.81), (extra:0.81), (low:0.81), (lowres:0.81), (speech:0.81), (worst:0.81), (blush:0.9), 2koma, 3koma, 4koma, collage, lipstick
Steps: 20, Sampler: DPM++ SDE Karras, CFG scale: 7, Seed: 2285895007, Size: 512x1152, Denoising strength: 0.7, Clip skip: 2
提示詞2

{Masterpiece, Kaname_Madoka, tall and long double tails, well rooted hair, (pink hair), pink eyes, crossed bangs, ojousama, jk, thigh bandages, wrist cuffs, (pink bow: 1.2)}, plain color, sketch, masterpiece, high detail, masterpiece portrait, best quality, ray tracing, {:<, look at the edge}
Negative prompt: ((((ugly)))), (((duplicate))), ((morbid)), ((mutilated)),extra fingers, mutated hands, ((poorly drawn hands)), ((poorly drawn face)), (((bad proportions))), ((extra limbs)), (((deformed))), (((disfigured))), cloned face, gross proportions, (malformed limbs), ((missing arms)), ((missing legs)), (((extra arms))), (((extra legs))), too many fingers, (((long neck))), (((low quality))), normal quality, blurry, bad feet, text font ui, ((((worst quality)))), anatomical nonsense, (((bad shadow))), unnatural body, liquid body, 3D, 3D game, 3D game scene, 3D character, bad hairs, poorly drawn hairs, fused hairs, big muscles, bad face, extra eyes, furry, pony, mosaic, disappearing calf, disappearing legs, extra digit, fewer digit, fused digit, missing digit, fused feet, poorly drawn eyes, big face, long face, bad eyes, thick lips, obesity, strong girl, beard,Excess legs
Steps: 20, Sampler: DPM++ SDE Karras, CFG scale: 7, Seed: 2468255263, Size: 512x1152, Denoising strength: 0.7, Clip skip: 2
🔧 技術細節
訓練信息
arb配置
arb:
enabled: true
debug: false
base_res: [512, 512]
max_size: [768, 512]
divisible: 64
max_ar_error: 4
min_dim: 256
dim_limit: 1024
調度器和優化器配置
scheduler:
name: diffusers.DDIMScheduler
params:
beta_end: 0.012
beta_schedule: "scaled_linear"
beta_start: 0.00085
clip_sample: false
num_train_timesteps: 1000
set_alpha_to_one: false
steps_offset: 1
trained_betas: null
optimizer:
name: bitsandbytes.optim.AdamW8bit
params:
lr: 2e-6
weight_decay: 5e-2
eps: 1e-7
lr_scheduler:
name: torch.optim.lr_scheduler.CosineAnnealingWarmRestarts
warmup:
enabled: true
init_lr: 2e-8
num_warmup: 50
strategy: "cos"
params:
T_0: 5
T_mult: 1
eta_min: 6e-7
last_epoch: -1
訓練資源消耗
大約花費了300個V100 GPU小時。
📄 許可證
本模型開放訪問,所有人均可使用,並遵循CreativeML OpenRAIL - M許可證,該許可證進一步規定了權利和使用方式。
CreativeML OpenRAIL許可證規定:
- 您不得使用該模型故意生成或分享非法或有害的輸出或內容。
- 作者對您生成的輸出不主張任何權利,您可以自由使用它們,但需對其使用負責,且使用不得違反許可證中的規定。
- 您可以重新分發模型權重,並將模型用於商業用途和/或作為服務使用。如果這樣做,請務必包含與許可證中相同的使用限制,並向所有用戶提供一份CreativeML OpenRAIL - M許可證副本(請完整、仔細閱讀許可證)。
請在此閱讀完整許可證