オープンソースのreddy-v4モデル - 無料で高品質の女性イメージ画像を生成する

ホーム

Reddy V4

Unmapped2895によって開発

FLUX.1-devをベースにした標準PEFT LoRAモデルで、高品質な女性イメージ画像の生成に特化

画像生成オープンソースライセンス:その他 #女性ファッション写真 #高解像度画像生成 #LoRA微調整モデル

ダウンロード数 59

リリース時間 : 4/4/2025

モデル概要

これはFLUX.1-devをベースにしたLoRA微調整モデルで、様々なシナリオにおける高品質な女性イメージ画像生成に特化しており、テキストから画像生成や画像から画像生成タスクをサポートします。

モデル特徴

高品質画像生成

ディテール豊かでスタイルの多様な高品質な女性イメージ画像を生成可能

LoRA微調整

低ランク適応(LoRA)技術を用いてベースモデルを効率的に微調整

多様なシナリオ対応

ヨガ、サイバーパンク、ファンタジーなど様々なスタイルの画像生成をサポート

フローマッチング技術

フローマッチング技術を用いてトレーニングされ、生成品質を向上

モデル能力

テキストから画像生成

画像から画像生成

高品質な人物イメージ生成

多様なスタイルの画像生成

使用事例

クリエイティブデザイン

ファッション写真生成

高級ファッション写真スタイルの下着モデル画像を生成

プロの写真品質を持つファッション画像を生成可能

キャラクターデザイン

ゲームや映像作品向けに様々なスタイルのキャラクターイメージを生成

サイバーパンク、ファンタジーなど多様なスタイルのキャラクターを生成可能

コンテンツ制作

ソーシャルメディアコンテンツ

SNS向けの目を引くインフルエンサースタイルの画像を生成

Instagramなどのプラットフォームに適した高品質コンテンツを生成可能

🚀 reddy-v4

このモデルは、black-forest-labs/FLUX.1-dev をベースにした標準的なPEFT LoRAです。主にテキストから画像を生成するタスクに使用されます。

🚀 クイックスタート

このセクションでは、モデルの基本的な使い方を説明します。

✨ 主な機能

テキストから画像への生成が可能です。
画像から画像への変換もサポートしています。
安全なコンテンツ生成が保証されています。

📦 インストール

このモデルはHugging FaceのDiffusersライブラリを使用しています。以下のコマンドで必要なライブラリをインストールできます。

pip install diffusers torch

💻 使用例

基本的な使用法

import torch
from diffusers import DiffusionPipeline

model_id = 'black-forest-labs/FLUX.1-dev'
adapter_id = 'Unmapped2895/reddy-v4'
pipeline = DiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.bfloat16) # loading directly in bf16
pipeline.load_lora_weights(adapter_id)

prompt = "Realistic wide shot photo of woman posing in a luxurious satin lingerie set, featuring a plunging bra, delicate thong and a classic garter belt with black stockings. The satin lingerie shimmers softly in the light, and the cut emphasizes both sophistication and a hint of allure. The lingerie is detailed with fine lace edges, highlighting her alluring figure. She elegantly styled hair as if getting ready for a formal event. The photo has a cinematic quality with rays of light and dramatic play of shadow and light"


## Optional: quantise the model to save on vram.
## Note: The model was not quantised during training, so it is not necessary to quantise it during inference time.
#from optimum.quanto import quantize, freeze, qint8
#quantize(pipeline.transformer, weights=qint8)
#freeze(pipeline.transformer)
    
pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu') # the pipeline is already in its target precision level
model_output = pipeline(
    prompt=prompt,
    num_inference_steps=20,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(42),
    width=832,
    height=1216,
    guidance_scale=3.5,
).images[0]

model_output.save("output.png", format="PNG")

📚 ドキュメント

検証設定

CFG: 3.5
CFG Rescale: 0.0
ステップ数: 20
サンプラー: FlowMatchEulerDiscreteScheduler
シード: 42
解像度: 832x1216
スキップレイヤーガイダンス:

注意: 検証設定は学習設定と必ずしも同じではありません。

学習設定

設定項目	詳細
学習エポック数	10
学習ステップ数	2000
学習率	0.0001
学習率スケジュール	一定
ウォームアップステップ数	500
最大勾配値	2.0
有効バッチサイズ	1
マイクロバッチサイズ	1
勾配累積ステップ数	1
GPU数	1
勾配チェックポインティング	True
予測タイプ	flow-matching (追加パラメータ=['shift=3', 'flux_guidance_mode=constant', 'flux_guidance_value=1.0', 'flow_matching_loss=compatible', 'flux_lora_target=all'])
オプティマイザ	adamw_bf16
学習可能パラメータの精度	Pure BF16
ベースモデルの精度	`no_change`
キャプションドロップアウト確率	10.0%
LoRAランク	16
LoRAアルファ	None
LoRAドロップアウト	0.1
LoRA初期化スタイル	default