reddy-v2開源圖像生成模型 - 免費生成高質量女性形象圖，支持文圖互轉

首頁

Reddy V2

由Unmapped2895開發

基於FLUX.1-dev的低秩適配器(LoRA)模型，專注於生成高質量女性形象圖像，支持文本生成圖像和圖像生成圖像任務。

圖像生成開源協議:其他 #女性時尚寫真 #高精度內衣建模 #賽博朋克風格

下載量 26

發布時間 : 4/3/2025

模型概述

該模型是一個標準PEFT低秩適配器，專為生成寫實風格的女性形象設計，特別擅長表現服裝細節和光影效果。

模型特點

低秩適應

採用LoRA技術進行輕量級微調，保持基礎模型性能的同時實現特定風格適配

高質量圖像生成

特別擅長生成高細節的女性形象，能精確表現服裝材質和身體特徵

多場景適用

支持從日常到奇幻場景的多樣化圖像生成需求

模型能力

文本生成圖像

圖像生成圖像

寫實風格圖像生成

服裝細節表現

光影效果渲染

使用案例

時尚攝影

內衣廣告

生成展示內衣產品的高質量模特圖像

能精確表現緞面材質的光澤和蕾絲細節

角色設計

遊戲角色

創建具有特定服裝風格的遊戲女性角色

可生成賽博朋克風格或奇幻風格的角色形象

🚀 reddy-v2

reddy-v2是一個標準的PEFT LoRA，源自black-forest-labs/FLUX.1-dev，可用於文本到圖像的生成任務。

🚀 快速開始

推理示例

以下是使用reddy-v2進行推理的示例代碼：

import torch
from diffusers import DiffusionPipeline

model_id = 'black-forest-labs/FLUX.1-dev'
adapter_id = 'Unmapped2895/reddy-v2'
pipeline = DiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.bfloat16) # loading directly in bf16
pipeline.load_lora_weights(adapter_id)

prompt = "Realistic wide shot photo of woman posing in a luxurious satin lingerie set, featuring a plunging bra, delicate thong and a classic garter belt with black stockings. The satin lingerie shimmers softly in the light, and the cut emphasizes both sophistication and a hint of allure. The lingerie is detailed with fine lace edges, highlighting her alluring figure. She elegantly styled hair as if getting ready for a formal event. The photo has a cinematic quality with rays of light and dramatic play of shadow and light"


## Optional: quantise the model to save on vram.
## Note: The model was quantised during training, and so it is recommended to do the same during inference time.
from optimum.quanto import quantize, freeze, qint8
quantize(pipeline.transformer, weights=qint8)
freeze(pipeline.transformer)
    
pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu') # the pipeline is already in its target precision level
model_output = pipeline(
    prompt=prompt,
    num_inference_steps=20,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(42),
    width=832,
    height=1216,
    guidance_scale=2.0,
).images[0]

model_output.save("output.png", format="PNG")

✨ 主要特性

基於black-forest-labs/FLUX.1-dev基礎模型，通過PEFT LoRA技術進行微調。
支持文本到圖像、圖像到圖像等多種生成任務。
訓練過程中使用特定的驗證提示和設置，確保生成圖像的質量。

📚 詳細文檔

驗證設置

CFG：2.0
CFG Rescale：0.0
Steps：20
Sampler：FlowMatchEulerDiscreteScheduler
Seed：42
Resolution：832x1216
Skip-layer guidance：無

注意：驗證設置不一定與訓練設置相同。

你可以在以下圖庫中找到一些示例圖像：

文本編碼器未進行訓練，推理時可複用基礎模型的文本編碼器。

訓練設置

屬性	詳情
訓練輪數	3
訓練步數	600
學習率	0.0001
學習率調度	常量
預熱步數	500
最大梯度值	2.0
有效批量大小	1
微批量大小	1
梯度累積步數	1
GPU數量	1
梯度檢查點	啟用
預測類型	flow - matching (額外參數=['shift=3', 'flux_guidance_mode=constant', 'flux_guidance_value=1.0', 'flow_matching_loss=compatible', 'flux_lora_target=all'])
優化器	adamw_bf16
可訓練參數精度	Pure BF16
基礎模型精度	`int8 - quanto`
字幕丟棄概率	0.0%
LoRA Rank	32
LoRA Alpha	無
LoRA Dropout	0.1
LoRA初始化風格	默認