Elizabeth Olsen Sdxl Flux
模型概述
該模型通過LoRA微調技術,可在穩定擴散框架下生成以伊麗莎白·奧爾森為原型的各類肖像,包括證件照、藝術肖像及漫威角色扮演等場景,支持高度細節化的特徵控制
模型特點
名人特徵精準還原
特別優化了伊麗莎白·奧爾森的面部特徵和漫威猩紅女巫造型的細節表現
多場景適配
支持從正式證件照到奇幻角色扮演的多樣化圖像生成需求
FLUX.1-dev優化
基於black-forest-labs/FLUX.1-dev基礎模型,增強照片級真實感表現
細粒度控制
通過詳細文本描述可精確控制服飾、配飾、表情等視覺元素
模型能力
高質量肖像生成
角色扮演圖像合成
多風格圖像轉換
細節特徵控制
使用案例
證件照生成
專業護照照片
生成符合要求的正式證件照,可自定義髮型、服飾和配飾
32784958.jpeg等示例展示
藝術創作
漫威角色扮演
生成猩紅女巫主題的奇幻肖像,包含混沌魔法特效等元素
example_w6n0petr4.png等示例展示
歷史場景再現
將人物置於古羅馬等歷史場景中,生成符合時代特徵的肖像
32702487.jpeg示例展示
商業應用
宣傳素材生成
快速生成帶有特定文字或品牌元素的宣傳圖片
32702761.jpeg示例展示
🚀 Elizabeth Olsen (SDXL+FLUX)
這是一個基於文本到圖像生成的模型,能夠生成以Elizabeth Olsen為主題的高質量圖像,尤其在生成Marvel Comics中Scarlet Witch形象時表現出色。
🚀 快速開始
模型下載
此模型的權重以Safetensors格式提供。你可以在 Files & versions 標籤頁 中下載。
使用🧨 diffusers庫
from diffusers import AutoPipelineForText2Image
import torch
device = "cuda" if torch.cuda.is_available() else "cpu"
pipeline = AutoPipelineForText2Image.from_pretrained('black-forest-labs/FLUX.1-dev', torch_dtype=torch.bfloat16).to(device)
pipeline.load_lora_weights('Keltezaa/elizabeth-olsen-sdxl-flux', weight_name='eliolsen_2017_local_164_merger_20v1_8v2_34v2_03_03_04.safetensors')
image = pipeline('High quality passport photo of a woman with wavy blonde hair wearing a suit and tie looking directly at the camera with her mouth closed and a neutral expression. She is also wearing a delicate gold chain and some understated diamond earrings.').images[0]
如需瞭解更多細節,包括LoRAs的加權、合併和融合,請查看 diffusers中加載LoRAs的文檔。
💻 使用示例
基礎用法
from diffusers import AutoPipelineForText2Image
import torch
device = "cuda" if torch.cuda.is_available() else "cpu"
pipeline = AutoPipelineForText2Image.from_pretrained('black-forest-labs/FLUX.1-dev', torch_dtype=torch.bfloat16).to(device)
pipeline.load_lora_weights('Keltezaa/elizabeth-olsen-sdxl-flux', weight_name='eliolsen_2017_local_164_merger_20v1_8v2_34v2_03_03_04.safetensors')
image = pipeline('High quality passport photo of a woman with wavy blonde hair wearing a suit and tie looking directly at the camera with her mouth closed and a neutral expression. She is also wearing a delicate gold chain and some understated diamond earrings.').images[0]
高級用法
# 這裡可以根據具體的高級使用場景進行代碼編寫和說明
# 例如,調整參數、使用不同的提示詞等
# 以下是一個簡單示例,假設可以調整生成圖像的尺寸
from diffusers import AutoPipelineForText2Image
import torch
device = "cuda" if torch.cuda.is_available() else "cpu"
pipeline = AutoPipelineForText2Image.from_pretrained('black-forest-labs/FLUX.1-dev', torch_dtype=torch.bfloat16).to(device)
pipeline.load_lora_weights('Keltezaa/elizabeth-olsen-sdxl-flux', weight_name='eliolsen_2017_local_164_merger_20v1_8v2_34v2_03_03_04.safetensors')
prompt = 'High quality passport photo of a woman with wavy blonde hair wearing a suit and tie looking directly at the camera with her mouth closed and a neutral expression. She is also wearing a delicate gold chain and some understated diamond earrings.'
image = pipeline(prompt, width=800, height=800).images[0]
📄 許可證
本模型使用的許可證為 bespoke-lora-trained-license。
模型信息
屬性 | 詳情 |
---|---|
模型類型 | 文本到圖像生成模型,基於Stable Diffusion和LoRA技術 |
基礎模型 | black-forest-labs/FLUX.1-dev |
標籤 | text-to-image, stable-diffusion, lora, diffusers, template:sd-lora, migrated, photorealistic, marvel, woman, celebrity |
示例輸出
提示文本 | 輸出圖像鏈接 |
---|---|
High quality passport photo of a woman wearing a suit and tie looking directly at the camera with her mouth closed and a neutral expression. She is also wearing a delicate gold chain and some understated diamond earrings. | 32784958.jpeg |
A closeup portrait photo of a woman with medium length hair who gazing directly at the camera with a neutral expression. She is wearing a multicolored knitted sweater against a dark background. Spotlight illumination. | 32702445.jpeg |
Instagram selfie of a woman during the ancient roman empire standing in the middle of a marketplace in ancient rome with people and merchants all around her. She has a toga on as would be befitting for a madam of the house and has a neutral expression. She has almost no makeup on. Her hair is in an intricate updo and held together by some golden hairpins. | 32702487.jpeg |
High res waist up portrait photo of a woman with her hair in platinum blonde box-braids with glittery eye-shadow and clear glossy lip-gloss. She is looking at the viewer with her mouth closed. She is wearing a thin string-like black choker and hoop earrings. In the background is a nightclub scene out of focus. | 32704240.jpeg |
Instagram photo of a woman sitting at a table wearing a light grey sweatshirt with green polka-dots on it, holding a sign that has the writing "Welcome to FLUX!!" written on it. She is looking at the camera with a slight smile and in the background are some office plants on either side of her out of focus. | 32702761.jpeg |
High quality passport photo of a woman with long wavy golden-blonde hair wearing a suit and tie looking directly at the camera with her mouth closed and a neutral expression. She is also wearing a delicate gold chain. | 32705182.jpeg |
High quality photo in side view of a woman looking at the viewer standing on top of a hill in the windy scottish highlands wearing a tight full sleeve figure hugging turtle neck shirt and white dress pants that are also form fitted. | 32702970.jpeg |
Instagram photo of a blonde woman with freckles on her face in the winter outdoors silhouette illumination of her hair. Taken with a ProPhoto iPhone camera. | 32703037.jpeg |
Contact sheet with 4 images of a woman with a messy short bob cut wearing a black turtleneck. | 32702875.jpeg |
High quality passport photo of a woman with wavy blonde hair wearing a suit and tie looking directly at the camera with her mouth closed and a neutral expression. She is also wearing a delicate gold chain and some understated diamond earrings. | 32785711.jpeg |
(Elizabeth Olsen:1.2)A captivating portrait of Marvel Comics' Scarlet Witch, Wanda Maximoff, with an ethereal quality that reflects her chaotic and powerful abilities. She stands in the center, her crimson hair cascading down in soft waves that frame her delicate yet intense expression. Her eyes are a piercing shade of emerald, hinting at the untamed mystical forces within her. She wears her iconic costume, consisting of a form-fitting red dress with a plunging neckline and gold accents that trace the lines of her body, emphasizing her voluptuous figure. The dress transitions into a flowing skirt that flares out slightly at her hips, giving her an aura of elegance and danger. The crimson hue of the dress matches her skin-tight leather boots that reach up to her thighs, adorned with golden laces and buckles. On her left hand, she wears the crimson gauntlet that channels her power, while her right hand is raised, her fingers curling as if casting a spell. Swirling around her wrist and forearm are intricate patterns of glowing red energy, a visual representation of her chaos magic. The background is a blend of dark purples and smoky blues, with flickering lights and spectral shapes hinting at the turbulent dimension of the Hex. Her left eye is slightly obscured by a lock of hair that's been pushed aside by an invisible force, revealing a sliver of her true power. The overall composition is dynamic and enigmatic, with a focus on the interplay of light and shadow that enhances the mystical and haunting atmosphere of the piece. | images/example_w6n0petr4.png |
(Elizabeth Olsen:1.2)A captivating passport photo of Marvel Comics' Scarlet Witch, Wanda Maximoff, with an ethereal quality that reflects her chaotic and powerful abilities. her crimson hair cascading down in soft waves with intricate patterns of glowing red energy that frame her delicate yet intense expression. She wears her iconic costume, consisting of a form-fitting red dress with a plunging neckline and gold accents that trace the lines of her body, emphasizing her voluptuous figure. On her left hand, she wears the crimson gauntlet that channels her power, while her right hand is raised, her fingers curling as if casting a spell. Swirling around her wrist and forearm are intricate patterns of glowing red energy, a visual representation of her chaos magic. The background is a blend of dark purples and smoky blues, with flickering lights and spectral shapes hinting at the turbulent dimension of the Hex. Her left eye is slightly obscured by a lock of hair that's been pushed aside by an invisible force, revealing a sliver of her true power. The overall composition is dynamic and enigmatic, with a focus on the interplay of light and shadow that enhances the mystical and haunting atmosphere of the piece. | images/example_346c9qxz5.png |
(Elizabeth Olsen:1.2)A captivating passport photo of Marvel Comics' Scarlet Witch, Wanda Maximoff, with an ethereal quality that reflects her chaotic and powerful abilities. her crimson hair cascading down in soft waves with intricate patterns of glowing red flaming energy that frame her delicate yet intense expression. She wears her iconic costume, consisting of a her iconic gold headband with a ruby red gem as a center piece and form-fitting red dress with a plunging neckline and gold accents that trace the lines of her body, emphasizing her voluptuous figure. On her left hand, she wears the crimson gauntlet that channels her power, while her right hand is raised, her fingers curling as if casting a spell. Swirling around her wrist and forearm are intricate patterns of glowing red energy, a visual representation of her chaos magic. The background is a blend of dark purples and smoky blues, with flickering lights and spectral shapes hinting at the turbulent dimension of the Hex. Her left eye is slightly obscured by a lock of hair that's been pushed aside by an invisible force, revealing a sliver of her true power. The overall composition is dynamic and enigmatic, with a focus on the interplay of light and shadow that enhances the mystical and haunting atmosphere of the piece. | images/example_emdmjecis.png |
Clip Vit Large Patch14 336
基於Vision Transformer架構的大規模視覺語言預訓練模型,支持圖像與文本的跨模態理解
文本生成圖像
Transformers

C
openai
5.9M
241
Fashion Clip
MIT
FashionCLIP是基於CLIP開發的視覺語言模型,專門針對時尚領域進行微調,能夠生成通用產品表徵。
文本生成圖像
Transformers 英語

F
patrickjohncyh
3.8M
222
Gemma 3 1b It
Gemma 3是Google推出的輕量級先進開放模型系列,基於與Gemini模型相同的研究和技術構建。該模型是多模態模型,能夠處理文本和圖像輸入並生成文本輸出。
文本生成圖像
Transformers

G
google
2.1M
347
Blip Vqa Base
Bsd-3-clause
BLIP是一個統一的視覺語言預訓練框架,擅長視覺問答任務,通過語言-圖像聯合訓練實現多模態理解與生成能力
文本生成圖像
Transformers

B
Salesforce
1.9M
154
CLIP ViT H 14 Laion2b S32b B79k
MIT
基於OpenCLIP框架在LAION-2B英文數據集上訓練的視覺-語言模型,支持零樣本圖像分類和跨模態檢索任務
文本生成圖像
Safetensors
C
laion
1.8M
368
CLIP ViT B 32 Laion2b S34b B79k
MIT
基於OpenCLIP框架在LAION-2B英語子集上訓練的視覺-語言模型,支持零樣本圖像分類和跨模態檢索
文本生成圖像
Safetensors
C
laion
1.1M
112
Pickscore V1
PickScore v1 是一個針對文本生成圖像的評分函數,可用於預測人類偏好、評估模型性能和圖像排序等任務。
文本生成圖像
Transformers

P
yuvalkirstain
1.1M
44
Owlv2 Base Patch16 Ensemble
Apache-2.0
OWLv2是一種零樣本文本條件目標檢測模型,可通過文本查詢在圖像中定位對象。
文本生成圖像
Transformers

O
google
932.80k
99
Llama 3.2 11B Vision Instruct
Llama 3.2 是 Meta 發佈的多語言多模態大型語言模型,支持圖像文本到文本的轉換任務,具備強大的跨模態理解能力。
文本生成圖像
Transformers 支持多種語言

L
meta-llama
784.19k
1,424
Owlvit Base Patch32
Apache-2.0
OWL-ViT是一個零樣本文本條件目標檢測模型,可以通過文本查詢搜索圖像中的對象,無需特定類別的訓練數據。
文本生成圖像
Transformers

O
google
764.95k
129
精選推薦AI模型
Llama 3 Typhoon V1.5x 8b Instruct
專為泰語設計的80億參數指令模型,性能媲美GPT-3.5-turbo,優化了應用場景、檢索增強生成、受限生成和推理任務
大型語言模型
Transformers 支持多種語言

L
scb10x
3,269
16
Cadet Tiny
Openrail
Cadet-Tiny是一個基於SODA數據集訓練的超小型對話模型,專為邊緣設備推理設計,體積僅為Cosmo-3B模型的2%左右。
對話系統
Transformers 英語

C
ToddGoldfarb
2,691
6
Roberta Base Chinese Extractive Qa
基於RoBERTa架構的中文抽取式問答模型,適用於從給定文本中提取答案的任務。
問答系統 中文
R
uer
2,694
98