Flux.1-Dev-Sketch-Card-LoRA开源模型 - 免费生成手持素描卡片风格图像

Home

Flux.1 Dev Sketch Card LoRA

Developed by prithivMLmods

基于LoRA技术的文本生成图像扩散模型，专注于生成手持素描卡片风格的图像

图像生成 Open Source License:Openrail #手绘卡通生成 #素描风格转换 #LoRA微调扩散

Downloads 25

Release Time : 11/18/2024

Model Overview

该模型是一个基于扩散模型的LoRA适配器，能够根据文本描述生成具有素描卡片风格的手持卡片图像。模型仍处于开发阶段，可能在某些场景表现欠佳。

Model Features

素描卡片风格生成

专门优化用于生成手持素描卡片风格的图像，卡片上绘制各种卡通形象

LoRA技术适配

采用LoRA技术对基础扩散模型进行微调，实现特定风格的图像生成

多元素场景构建

能够生成包含卡片、人物、背景装饰等多元素的复杂场景

Model Capabilities

文本生成图像

风格化图像生成

卡通形象生成

场景构图

Use Cases

创意设计

卡通贺卡设计

根据描述生成各种卡通形象的素描贺卡

示例中展示了马里奥、熊猫、小黄人等卡通形象的素描卡片

场景构图

生成包含卡片、桌面布置、背景装饰的完整场景

示例中包含白色桌布、小彩灯等装饰元素

🚀 Flux.1-Dev-Sketch-Card-LoRA

本项目是一个基于LoRA技术的文本到图像生成模型，可根据输入的文本描述生成相应的草图卡片图像。目前模型仍在训练阶段，后续会不断优化。

🚀 快速开始

环境设置

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Flux.1-Dev-Sketch-Card-LoRA"
trigger_word = "sketch card"  
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)

触发词使用

你应该使用 sketch card 来触发图像生成。

模型下载

此模型的权重以Safetensors格式提供。点击下载，可在“Files & versions” 标签中找到。

✨ 主要特性

文本到图像生成：根据输入的文本描述生成对应的草图卡片图像。
LoRA技术：使用低秩自适应（LoRA）技术进行模型微调，提高生成效率。

📦 安装指南

暂未提供具体安装步骤，可参考上述快速开始部分的代码示例进行环境设置。

💻 使用示例

基础用法

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Flux.1-Dev-Sketch-Card-LoRA"
trigger_word = "sketch card"  
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)

# 示例文本输入
text = 'sketch card, a close-up of a hand holding a card with a cartoon image of Mario on it. The card has a yellow background with a red cap and a red M on it, and the character is wearing blue overalls with a yellow button on the left side of his chest. The character is waving his left hand and has a big smile on his face. To the right of the card is a small cartoon character with a blue outfit and red hat. They are standing on a table with a white tablecloth. The table is adorned with small lights, adding a pop of color to the scene.'
image = pipe(text).images[0]
image.save("output.png")

📚 详细文档

模型描述

prithivMLmods/Flux.1-Dev-Sketch-Card-LoRA

属性	详情
基础模型	black-forest-labs/FLUX.1-dev
实例提示词	sketch card
许可证	creativeml-openrail-m

图像处理参数

参数	值	参数	值
学习率调度器	constant	噪声偏移	0.03
优化器	AdamW	多分辨率噪声折扣	0.1
网络维度	64	多分辨率噪声迭代次数	10
网络阿尔法	32	重复次数与步数	14 & 1990
训练轮数	16	每N轮保存一次	1