Flux.1-Dev-Hand-Sticky-LoRA开源模型 - 可生成手部握持贴纸图像，训练中别错过！

首页

Flux.1 Dev Hand Sticky LoRA

由 prithivMLmods 开发

这是一个基于FLUX.1-dev的LoRA模型，专注于生成手部握持贴纸的图像，目前仍处于训练阶段。

图像生成开源协议:Openrail #手部贴纸生成 #LoRA微调扩散模型 #激励标语设计

下载量 33

发布时间 : 11/17/2024

模型简介

该模型用于生成手部握持贴纸的高质量图像，支持多种贴纸设计和背景组合。

模型特点

手部贴纸特化

专门针对手部握持贴纸的场景进行优化

多元素组合

支持贴纸、手部、背景等多种元素的自然组合

高分辨率输出

支持768x1024和1024x1024的高分辨率图像生成

模型能力

文本生成图像

手部姿势生成

贴纸设计生成

背景合成

使用案例

创意设计

励志贴纸设计

生成带有励志文字的手持贴纸图像

如示例中的'你可以做到！'贴纸

趣味贴纸设计

生成带有趣味图案的手持贴纸图像

如示例中的笑脸冰淇淋贴纸

社交媒体内容

社交媒体贴纸素材

为社交媒体创作独特的手持贴纸内容

🚀 Flux.1-Dev-Hand-Sticky-LoRA

本模型是一个文本到图像的模型，借助LoRA技术，可生成与手持贴纸相关的图像，为图像创作提供新的可能。

🚀 快速开始

模型设置

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Flux.1-Dev-Hand-Sticky-LoRA"
trigger_word = "handstick69"  
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)

触发图像生成

你应该使用 handstick69 来触发图像生成。

下载模型

此模型的权重以Safetensors格式提供。点击下载，可在“文件与版本”选项卡中获取。

✨ 主要特性

文本到图像转换：根据输入的文本描述，生成与之对应的手持贴纸图像。
LoRA技术：利用低秩自适应（LoRA）技术，提高模型训练效率和性能。

📦 安装指南

暂未提供具体安装步骤，可参考上述快速开始部分的代码示例进行模型设置。

💻 使用示例

基础用法

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Flux.1-Dev-Hand-Sticky-LoRA"
trigger_word = "handstick69"  
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)

# 示例文本
text = f'{trigger_word}, a human hand is holding two small stickers, each with the words "you can do this!" written on them in black text. The left sticker is pink, while the right sticker is yellow, with black text written on it. Behind the hand, there is a plant with green leaves and a white tile floor.'
image = pipe(text).images[0]
image.show()

📚 详细文档

模型描述

prithivMLmods/Flux.1-Dev-Hand-Sticky-LoRA

图像处理参数

参数	值	参数	值
LR调度器	constant	噪声偏移	0.03
优化器	AdamW	多分辨率噪声折扣	0.1
网络维度	64	多分辨率噪声迭代次数	10
网络Alpha	32	重复次数与步数	17 & 1920
轮数	10	每N轮保存一次	1