ProteusV0.2開源文生圖模型 - 強化核心功能，提示詞理解與風格表現更出色

首頁

Proteusv0.2

由dataautogpt3開發

ProteusV0.2是基於OpenDalleV1.1的進階版本，通過核心功能強化實現卓越的文生圖效果，特別在提示詞理解和風格表現上有顯著提升。

圖像生成開源協議:Gpl-3.0 #超現實動漫混合 #高精度面部渲染 #DPO優化畫質

下載量 20.62k

發布時間 : 1/19/2024

模型概述

ProteusV0.2是一個文生圖模型，專注於生成高質量圖像，支持超現實主義、動漫及卡通等多種視覺風格。通過混合RealCartoonXL和直接偏好優化（DPO）技術，提升了模型的提示響應能力和創造力。

模型特點

靈敏的提示響應

模型對提示詞的理解能力顯著提升，能夠更準確地生成符合描述的圖像。

卓越的風格表現

支持超現實主義、動漫、卡通等多種視覺風格，風格表現力接近MJ6。

複雜面部特徵優化

通過動態加載LORA模型技術，顯著提升了面部特徵和真實膚質的表現。

直接偏好優化（DPO）

使用精選的AI生成圖像對進行優化，提升了模型的整體性能。

模型能力

文本到圖像生成

多風格圖像生成

高質量細節渲染

複雜場景生成

使用案例

藝術創作

動漫角色設計

生成具有特定風格的動漫角色圖像，如黑色蓬鬆的貓科動物或像素藝術風格的太空少女。

高質量、細節豐富的動漫角色圖像。

肖像畫生成

生成具有特定風格和情感的肖像畫，如白鬚長髯的老者或深夜酒吧獨坐的銀行從業者。

情感豐富、風格獨特的肖像畫。

影視概念設計

電影風格劇照

生成具有電影膠片質感的劇照，如日本地鐵上的和服女子。

具有柯達電影膠片質感的劇照，淺景深和暗角暈影效果。

科幻場景設計

生成太空廢土風格的場景，如身著橄欖綠舊棉袍的聖女貞德。

充滿科幻氛圍的髒汙噪點效果，極致細節。

🚀 ProteusV0.2

ProteusV0.2是一款強大的文本到圖像生成模型，它在OpenDalleV1.1的基礎上進行了顯著改進，能更好地理解提示詞，在多種風格的圖像生成上表現出色，尤其在超現實主義、動漫和卡通風格方面。

🚀 快速開始

模型特點

ProteusV0.2與RealCartoonXL合併，僅以0.5%的權重，就解決了無法理解與動漫或卡通風格相關標籤的問題。與版本0.1相比，版本0.2有了細微但顯著的改進，在提示理解上超越了MJ6，同時也接近其風格表現能力。

模型優勢

Proteus是對OpenDalleV1.1的複雜增強，利用其核心功能來提供更優的結果。主要改進領域包括對提示的更高響應性和增強的創作能力。為實現這一點，它使用了約220,000張來自免版權庫存圖像（包含一些動漫）的GPTV字幕圖像進行微調，然後進行了歸一化處理。此外，還通過精心挑選的10,000對高質量AI生成圖像對，採用了直接偏好優化（DPO）方法。

模型效果

為追求最佳性能，大量的低秩適應（LORA）模型被獨立訓練，然後通過動態應用方法有選擇地整合到主模型中。這些技術在學習階段針對模型的特定部分，同時避免干擾其他區域。因此，Proteus在描繪複雜的面部特徵和逼真的皮膚紋理方面有顯著改進，同時在各種美學領域，特別是超現實主義、動漫和卡通風格可視化方面保持了出色的能力。

✨ 主要特性

風格融合：與RealCartoonXL合併，解決動漫和卡通風格標籤理解問題。
提示理解增強：超越MJ6的提示理解能力。
多風格適配：在超現實主義、動漫和卡通風格等多種美學領域表現出色。
細節優化：在描繪複雜面部特徵和皮膚紋理方面有顯著改進。

📦 安裝指南

使用以下設置以獲得ProteusV0.2的最佳效果：

CFG Scale：使用8到7的CFG比例。
Steps：20到60步以獲得更多細節，20步以獲得更快結果。
Sampler：DPM++ 2M SDE
Scheduler：Karras
Resolution：1280x1280或1024x1024

此外，建議在提示詞中使用以下關鍵詞來提升效果：最佳質量、高清、~*~美學~*~。

如果在構思提示詞方面遇到困難，可以使用這個我整理的GPT來幫助優化提示詞：點擊進入

💻 使用示例

基礎用法

import torch
from diffusers import (
    StableDiffusionXLPipeline, 
    KDPM2AncestralDiscreteScheduler,
    AutoencoderKL
)

# Load VAE component
vae = AutoencoderKL.from_pretrained(
    "madebyollin/sdxl-vae-fp16-fix", 
    torch_dtype=torch.float16
)

# Configure the pipeline
pipe = StableDiffusionXLPipeline.from_pretrained(
    "dataautogpt3/ProteusV0.2", 
    vae=vae,
    torch_dtype=torch.float16
)
pipe.scheduler = KDPM2AncestralDiscreteScheduler.from_config(pipe.scheduler.config)
pipe.to('cuda')

# Define prompts and generate image
prompt = "black fluffy gorgeous dangerous cat animal creature, large orange eyes, big fluffy ears, piercing gaze, full moon, dark ambiance, best quality, extremely detailed"
negative_prompt = "nsfw, bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"

image = pipe(
    prompt, 
    negative_prompt=negative_prompt, 
    width=1024,
    height=1024,
    guidance_scale=7.5,
    num_inference_steps=50
).images[0]

📄 許可證

本項目採用GPL-3.0許可證。

支持作者

如果您覺得這個項目有幫助，請通過以下方式支持作者：

示例展示

| 輸入提示詞 | 輸出圖片鏈接 | | ---- | ---- | | black fluffy gorgeous dangerous cat animal creature, large orange eyes, big fluffy ears, piercing gaze, full moon, dark ambiance, best quality, extremely detailed | [ComfyUI_03087_.png](ComfyUI_03087_.png) | | (impressionistic realism by csybgh), a 50 something male, working in banking, very short dyed dark curly balding hair, Afro-Asiatic ancestry, talks a lot but listens poorly, stuck in the past, wearing a suit, he has a certain charm, bronze skintone, sitting in a bar at night, he is smoking and feeling cool, drunk on plum wine, masterpiece, 8k, hyper detailed, smokey ambiance, perfect hands AND fingers | [GEN8-iTXcAA-okN.jpeg](GEN8-iTXcAA-okN.jpeg) | | high quality pixel art, a pixel art silhouette of an anime space-themed girl in a space-punk steampunk style, lying in her bed by the window of a spaceship, smoking, with a rustic feel. The image should embody epic portraiture and double exposure, featuring an isolated landscape visible through the window. The colors should primarily be dynamic and action-packed, with a strong use of negative space. The entire artwork should be in pixel art style, emphasizing the characters shape and set against a white background. Silhouette | [ComfyUI_03060_.png](ComfyUI_03060_.png) | | The image features an older man, a long white beard and mustache, He has a stern expression, giving the impression of a wise and experienced individual. The mans beard and mustache are prominent, adding to his distinguished appearance. The close-up shot of the mans face emphasizes his facial features and the intensity of his gaze. | [ComfyUI_03017_.png](ComfyUI_03017_.png) | | Super Closeup Portrait, action shot, Profoundly dark whitish meadow, glass flowers, Stains, space grunge style, Jeanne d'Arc wearing White Olive green used styled Cotton frock, Wielding thin silver sword, Sci-fi vibe, dirty, noisy, Vintage monk style, very detailed, hd | [ComfyUI_03045.png](ComfyUI_03045.png) | | cinematic film still of Kodak Motion Picture Film: (Sharp Detailed Image) An Oscar winning movie for Best Cinematography a woman in a kimono standing on a subway train in Japan Kodak Motion Picture Film Style, shallow depth of field, vignette, highly detailed, high budget, bokeh, cinemascope, moody, epic, gorgeous, film grain, grainy | [3.png](3.png) | | in the style of artgerm, comic style,3D model, mythical seascape, negative space, space quixotic dreams, temporal hallucination, psychedelic, mystical, intricate details, very bright neon colors, (vantablack background:1.5), pointillism, pareidolia, melting, symbolism, very high contrast, chiaroscuro | [ComfyUI_03061_.png](ComfyUI_03061_.png) | | 1980s anime portrait of a character glitching. His face is separated from his body by heavy static. His face is deformed by pain. Dream-like, analog horror, glitch, terrifying | [ComfyUI_03092_.png](ComfyUI_03092_.png) | | (("Proteus"):text_logo:1) | [ComfyUI_03297_.png](ComfyUI_03297_.png) | | dan seagrave, dante, Abandon All Hope, Ye Who Enter Here, hell religious art purgatory zdzislaw Beksinski, abyss inferno, lost, wanderer | [ComfyUI_03483_.png](ComfyUI_03483_.png) |