開源reddy-v4模型 - 免費生成高質量女性形象圖像

首頁

Reddy V4

由Unmapped2895開發

基於FLUX.1-dev的標準PEFT LoRA模型，專注於生成高質量女性形象圖像

圖像生成開源協議:其他 #女性時尚寫真 #高分辨率圖像生成 #LoRA微調模型

下載量 59

發布時間 : 4/4/2025

模型概述

這是一個基於FLUX.1-dev的LoRA微調模型，專門用於生成各種場景下的高質量女性形象圖像，支持文本生成圖像和圖像生成圖像任務。

模型特點

高質量圖像生成

能夠生成細節豐富、風格多樣的高質量女性形象圖像

LoRA微調

採用低秩適應(LoRA)技術對基礎模型進行高效微調

多場景支持

支持多種場景下的圖像生成，包括瑜伽、賽博朋克、奇幻等不同風格

流匹配技術

採用流匹配(Flow Matching)技術進行訓練，提升生成質量

模型能力

文本生成圖像

圖像生成圖像

高質量人物形象生成

多風格圖像生成

使用案例

創意設計

時尚攝影生成

生成高端時尚攝影風格的內衣模特圖像

可生成具有專業攝影質感的時尚圖像

角色設計

為遊戲或影視作品生成各種風格的角色形象

可生成賽博朋克、奇幻等多種風格的角色形象

內容創作

社交媒體內容

為社交媒體生成吸引眼球的網紅風格圖像

可生成適合Instagram等平臺的高質量內容

🚀 reddy-v4

reddy-v4 是一個基於 black-forest-labs/FLUX.1-dev 的標準 PEFT LoRA 模型，可用於文本到圖像的生成任務。

🚀 快速開始

推理示例

import torch
from diffusers import DiffusionPipeline

model_id = 'black-forest-labs/FLUX.1-dev'
adapter_id = 'Unmapped2895/reddy-v4'
pipeline = DiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.bfloat16) # loading directly in bf16
pipeline.load_lora_weights(adapter_id)

prompt = "Realistic wide shot photo of woman posing in a luxurious satin lingerie set, featuring a plunging bra, delicate thong and a classic garter belt with black stockings. The satin lingerie shimmers softly in the light, and the cut emphasizes both sophistication and a hint of allure. The lingerie is detailed with fine lace edges, highlighting her alluring figure. She elegantly styled hair as if getting ready for a formal event. The photo has a cinematic quality with rays of light and dramatic play of shadow and light"


## Optional: quantise the model to save on vram.
## Note: The model was not quantised during training, so it is not necessary to quantise it during inference time.
#from optimum.quanto import quantize, freeze, qint8
#quantize(pipeline.transformer, weights=qint8)
#freeze(pipeline.transformer)
    
pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu') # the pipeline is already in its target precision level
model_output = pipeline(
    prompt=prompt,
    num_inference_steps=20,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(42),
    width=832,
    height=1216,
    guidance_scale=3.5,
).images[0]

model_output.save("output.png", format="PNG")

✨ 主要特性

基於 black-forest-labs/FLUX.1-dev 基礎模型構建，屬於標準 PEFT LoRA 模型。
支持文本到圖像、圖像到圖像等多種生成任務。
訓練和推理過程中提供了詳細的參數設置。

📦 安裝指南

文檔未提及具體安裝步驟，暫不提供。

💻 使用示例

基礎用法

import torch
from diffusers import DiffusionPipeline

model_id = 'black-forest-labs/FLUX.1-dev'
adapter_id = 'Unmapped2895/reddy-v4'
pipeline = DiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.bfloat16) # loading directly in bf16
pipeline.load_lora_weights(adapter_id)

prompt = "Realistic wide shot photo of woman posing in a luxurious satin lingerie set, featuring a plunging bra, delicate thong and a classic garter belt with black stockings. The satin lingerie shimmers softly in the light, and the cut emphasizes both sophistication and a hint of allure. The lingerie is detailed with fine lace edges, highlighting her alluring figure. She elegantly styled hair as if getting ready for a formal event. The photo has a cinematic quality with rays of light and dramatic play of shadow and light"

pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu') # the pipeline is already in its target precision level
model_output = pipeline(
    prompt=prompt,
    num_inference_steps=20,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(42),
    width=832,
    height=1216,
    guidance_scale=3.5,
).images[0]

model_output.save("output.png", format="PNG")

高級用法

文檔未提及高級用法示例，暫不提供。

📚 詳細文檔

驗證設置

CFG：3.5
CFG Rescale：0.0
步數：20
採樣器：FlowMatchEulerDiscreteScheduler
隨機種子：42
分辨率：832x1216
跳過層引導：無

注意：驗證設置不一定與訓練設置相同。

訓練設置

屬性	詳情
訓練輪數	10
訓練步數	2000
學習率	0.0001
學習率調度	常量
熱身步數	500
最大梯度值	2.0
有效批量大小	1
微批量大小	1
梯度累積步數	1
GPU 數量	1
梯度檢查點	啟用
預測類型	flow - matching (額外參數=['shift=3', 'flux_guidance_mode=constant', 'flux_guidance_value=1.0', 'flow_matching_loss=compatible', 'flux_lora_target=all'])
優化器	adamw_bf16
可訓練參數精度	Pure BF16
基礎模型精度	`no_change`
字幕丟棄概率	10.0%
LoRA 秩	16
LoRA Alpha	無
LoRA 丟棄率	0.1
LoRA 初始化風格	默認