Flux.1-Dev-Hand-Sticky-LoRA開源模型 - 可生成手部握持貼紙圖像，訓練中別錯過！

首頁

Flux.1 Dev Hand Sticky LoRA

由prithivMLmods開發

這是一個基於FLUX.1-dev的LoRA模型，專注於生成手部握持貼紙的圖像，目前仍處於訓練階段。

圖像生成開源協議:Openrail #手部貼紙生成 #LoRA微調擴散模型 #激勵標語設計

下載量 33

發布時間 : 11/17/2024

模型概述

該模型用於生成手部握持貼紙的高質量圖像，支持多種貼紙設計和背景組合。

模型特點

手部貼紙特化

專門針對手部握持貼紙的場景進行優化

多元素組合

支持貼紙、手部、背景等多種元素的自然組合

高分辨率輸出

支持768x1024和1024x1024的高分辨率圖像生成

模型能力

文本生成圖像

手部姿勢生成

貼紙設計生成

背景合成

使用案例

創意設計

勵志貼紙設計

生成帶有勵志文字的手持貼紙圖像

如示例中的'你可以做到！'貼紙

趣味貼紙設計

生成帶有趣味圖案的手持貼紙圖像

如示例中的笑臉冰淇淋貼紙

社交媒體內容

社交媒體貼紙素材

為社交媒體創作獨特的手持貼紙內容

🚀 Flux.1-Dev-Hand-Sticky-LoRA

本模型是一個文本到圖像的模型，藉助LoRA技術，可生成與手持貼紙相關的圖像，為圖像創作提供新的可能。

🚀 快速開始

模型設置

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Flux.1-Dev-Hand-Sticky-LoRA"
trigger_word = "handstick69"  
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)

觸發圖像生成

你應該使用 handstick69 來觸發圖像生成。

下載模型

此模型的權重以Safetensors格式提供。點擊下載，可在“文件與版本”選項卡中獲取。

✨ 主要特性

文本到圖像轉換：根據輸入的文本描述，生成與之對應的手持貼紙圖像。
LoRA技術：利用低秩自適應（LoRA）技術，提高模型訓練效率和性能。

📦 安裝指南

暫未提供具體安裝步驟，可參考上述快速開始部分的代碼示例進行模型設置。

💻 使用示例

基礎用法

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Flux.1-Dev-Hand-Sticky-LoRA"
trigger_word = "handstick69"  
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)

# 示例文本
text = f'{trigger_word}, a human hand is holding two small stickers, each with the words "you can do this!" written on them in black text. The left sticker is pink, while the right sticker is yellow, with black text written on it. Behind the hand, there is a plant with green leaves and a white tile floor.'
image = pipe(text).images[0]
image.show()

📚 詳細文檔

模型描述

prithivMLmods/Flux.1-Dev-Hand-Sticky-LoRA

圖像處理參數

參數	值	參數	值
LR調度器	constant	噪聲偏移	0.03
優化器	AdamW	多分辨率噪聲折扣	0.1
網絡維度	64	多分辨率噪聲迭代次數	10
網絡Alpha	32	重複次數與步數	17 & 1920
輪數	10	每N輪保存一次	1