Flux.1-Dev-Realtime-Toon-Mixオープンソースモデル - 無料でカートゥーンミックススタイルの画像を生成（最終版ではない）

ホーム

Flux.1 Dev Realtime Toon Mix

prithivMLmodsによって開発

これはLoRAベースのテキストから画像への拡散モデルで、アニメ混合スタイルの画像生成に特化しています。モデルはまだトレーニング段階にあり、最終版ではありません。

画像生成オープンソースライセンス:Openrail #アニメ風LoRA #リアルタイム画像生成 #混合アートスタイル

ダウンロード数 76

リリース時間 : 11/20/2024

モデル概要

このモデルはblack-forest-labs/FLUX.1-devベースモデルを基に開発され、LoRA技術を通じてアニメ混合スタイルの画像生成を実現します。様々な人物やシーンのアニメスタイルレンダリングをサポートします。

モデル特徴

アニメ混合スタイル

ユニークなアニメ混合スタイルの画像を生成でき、アニメとリアルな要素を組み合わせます

LoRA微調整

LoRA技術に基づいてベース拡散モデルを微調整し、ベースモデルの能力を維持しながら特定のスタイルを追加します

多様なシーンサポート

様々なシーンや人物の組み合わせのアニメスタイル画像生成をサポートします

モデル能力

テキストから画像生成

アニメスタイルレンダリング

人物イメージ生成

シーン生成

使用事例

クリエイティブデザイン

キャラクターデザイン

ユニークなスタイルのアニメキャラクターイメージを生成

様々な髪型、服装、アクセサリーの組み合わせのキャラクターを生成可能

シーンデザイン

アニメスタイルの背景シーンを作成

自然景観、都市の通りなど様々な背景を生成可能

コンテンツ制作

イラスト制作

ストーリーや記事のためのイラストを生成

スタイルが統一されたシリーズイラストを提供

🚀 Flux.1-Dev-Realtime-Toon-Mix

このモデルは、テキストから画像を生成する能力を持ち、特に「toon mix」というスタイルの画像生成に特化しています。現在はまだ学習段階にあり、最終バージョンではないため、一部のケースでアーティファクトが含まれる可能性があり、性能が低い場合があります。

🚀 クイックスタート

このモデルを使用するには、以下の手順に従ってセットアップを行います。

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Flux.1-Dev-Realtime-Toon-Mix"
trigger_word = "toon mix"  
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)

✨ 主な機能

テキストから画像生成：入力されたテキストに基づいて、「toon mix」スタイルの画像を生成します。
多様な画像生成：異なるテキスト入力により、様々なシーンや人物の画像を生成できます。

📦 インストール

このモデルを使用するには、上記のセットアップコードを実行する必要があります。必要なライブラリをインストールし、モデルをロードしてください。

💻 使用例

基本的な使用法

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Flux.1-Dev-Realtime-Toon-Mix"
trigger_word = "toon mix"  
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)

# 画像生成の例
prompt = f"{trigger_word}, a young woman with long brown hair, wearing a white dress"
image = pipe(prompt).images[0]
image.save("generated_image.png")

高度な使用法

# 高度な画像生成の例
prompt = f"{trigger_word}, a beautiful landscape with a sunset, a lake, and a small house"
# 追加のパラメータを設定
num_inference_steps = 50
guidance_scale = 7.5
image = pipe(prompt, num_inference_steps=num_inference_steps, guidance_scale=guidance_scale).images[0]
image.save("advanced_generated_image.png")

📚 ドキュメント

モデルの説明

prithivMLmods/Flux.1-Dev-Realtime-Toon-Mix

画像処理パラメータ

パラメータ	値	パラメータ	値
LR Scheduler	constant	Noise Offset	0.03
Optimizer	AdamW	Multires Noise Discount	0.1
Network Dim	64	Multires Noise Iterations	10
Network Alpha	32	Repeat & Steps	17 & 2900
Epoch	15	Save Every N Epochs	1

ラベリング: florence2-en(自然言語 & 英語)

学習に使用された総画像数: 29

最適な解像度

768 x 1024 (最適)
1024 x 1024 (デフォルト)

トリガーワード

画像生成をトリガーするには、toon mix を使用する必要があります。

モデルのダウンロード

このモデルのウェイトはSafetensors形式で利用可能です。ダウンロードは、「Files & versions」タブから行えます。

ギャラリー

以下は、このモデルによって生成された画像の例です。

入力テキスト: 'toon mix, a medium-haired woman, dressed in a white shirt and dark-colored bikini, is adorned with a white scarf tied around her waist. She is holding a gray bottle, her right hand resting on her hip. Her left hand is wrapped around the bottle, adding a pop of color to her body. Shes wearing a pink sun visor, and her brown hair cascades over her shoulders. The backdrop, a rocky cliff, is a stark contrast to the blue sky, dotted with white stars.'
- 出力画像: 画像リンク
入力テキスト: 'toon mix, a vibrant orange hat stands atop a womans head. The womans long, brown hair cascades over her shoulders, framing her face. She is wearing a black jacket, adorned with a bright orange scarf, and a yellow backpack. The background is blurred, with a few cars parked on the street, adding a touch of color to the scene.'
- 出力画像: 画像リンク