Flux.1-Dev-Sketch-Card-LoRA開源模型 - 免費生成手持素描卡片風格圖像

首頁

Flux.1 Dev Sketch Card LoRA

由prithivMLmods開發

基於LoRA技術的文本生成圖像擴散模型，專注於生成手持素描卡片風格的圖像

圖像生成開源協議:Openrail #手繪卡通生成 #素描風格轉換 #LoRA微調擴散

下載量 25

發布時間 : 11/18/2024

模型概述

該模型是一個基於擴散模型的LoRA適配器，能夠根據文本描述生成具有素描卡片風格的手持卡片圖像。模型仍處於開發階段，可能在某些場景表現欠佳。

模型特點

素描卡片風格生成

專門優化用於生成手持素描卡片風格的圖像，卡片上繪製各種卡通形象

LoRA技術適配

採用LoRA技術對基礎擴散模型進行微調，實現特定風格的圖像生成

多元素場景構建

能夠生成包含卡片、人物、背景裝飾等多元素的複雜場景

模型能力

文本生成圖像

風格化圖像生成

卡通形象生成

場景構圖

使用案例

創意設計

卡通賀卡設計

根據描述生成各種卡通形象的素描賀卡

示例中展示了馬里奧、熊貓、小黃人等卡通形象的素描卡片

場景構圖

生成包含卡片、桌面佈置、背景裝飾的完整場景

示例中包含白色桌布、小彩燈等裝飾元素

🚀 Flux.1-Dev-Sketch-Card-LoRA

本項目是一個基於LoRA技術的文本到圖像生成模型，可根據輸入的文本描述生成相應的草圖卡片圖像。目前模型仍在訓練階段，後續會不斷優化。

🚀 快速開始

環境設置

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Flux.1-Dev-Sketch-Card-LoRA"
trigger_word = "sketch card"  
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)

觸發詞使用

你應該使用 sketch card 來觸發圖像生成。

模型下載

此模型的權重以Safetensors格式提供。點擊下載，可在“Files & versions” 標籤中找到。

✨ 主要特性

文本到圖像生成：根據輸入的文本描述生成對應的草圖卡片圖像。
LoRA技術：使用低秩自適應（LoRA）技術進行模型微調，提高生成效率。

📦 安裝指南

暫未提供具體安裝步驟，可參考上述快速開始部分的代碼示例進行環境設置。

💻 使用示例

基礎用法

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Flux.1-Dev-Sketch-Card-LoRA"
trigger_word = "sketch card"  
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)

# 示例文本輸入
text = 'sketch card, a close-up of a hand holding a card with a cartoon image of Mario on it. The card has a yellow background with a red cap and a red M on it, and the character is wearing blue overalls with a yellow button on the left side of his chest. The character is waving his left hand and has a big smile on his face. To the right of the card is a small cartoon character with a blue outfit and red hat. They are standing on a table with a white tablecloth. The table is adorned with small lights, adding a pop of color to the scene.'
image = pipe(text).images[0]
image.save("output.png")

📚 詳細文檔

模型描述

prithivMLmods/Flux.1-Dev-Sketch-Card-LoRA

屬性	詳情
基礎模型	black-forest-labs/FLUX.1-dev
實例提示詞	sketch card
許可證	creativeml-openrail-m

圖像處理參數

參數	值	參數	值
學習率調度器	constant	噪聲偏移	0.03
優化器	AdamW	多分辨率噪聲折扣	0.1
網絡維度	64	多分辨率噪聲迭代次數	10
網絡阿爾法	32	重複次數與步數	14 & 1990
訓練輪數	16	每N輪保存一次	1