🚀 SegMoE-4x2-v0:Segmind擴散專家混合模型
SegMoE-4x2-v0是一個未經訓練的Segmind擴散專家混合模型,它使用segmoe框架,基於4個專家SDXL模型生成。SegMoE是一個強大的框架,可在幾分鐘內將多個穩定擴散模型動態組合成專家混合模型,無需進行訓練。該框架允許即時創建更大的模型,這些模型具有更廣泛的知識、更好的貼合度和更高的圖像質量。

🚀 快速開始
本模型可通過segmoe庫使用。
📦 安裝指南
請確保通過以下命令安裝segmoe
:
pip install segmoe
💻 使用示例
基礎用法
from segmoe import SegMoEPipeline
pipeline = SegMoEPipeline("segmind/SegMoE-4x2-v0", device = "cuda")
prompt = "cosmic canvas, orange city background, painting of a chubby cat"
negative_prompt = "nsfw, bad quality, worse quality"
img = pipeline(
prompt=prompt,
negative_prompt=negative_prompt,
height=1024,
width=1024,
num_inference_steps=25,
guidance_scale=7.5,
).images[0]
img.save("image.png")

🔧 技術細節
Config
用於創建此模型的配置如下:
base_model: SG161222/RealVisXL_V3.0
num_experts: 4
moe_layers: all
num_experts_per_tok: 2
experts:
- source_model: frankjoshua/juggernautXL_v8Rundiffusion
positive_prompt: "aesthetic, cinematic, hands, portrait, photo, illustration, 8K, hyperdetailed, origami, man, woman, supercar"
negative_prompt: "(worst quality, low quality, normal quality, lowres, low details, oversaturated, undersaturated, overexposed, underexposed, grayscale, bw, bad photo, bad photography, bad art:1.4), (watermark, signature, text font, username, error, logo, words, letters, digits, autograph, trademark, name:1.2), (blur, blurry, grainy), morbid, ugly, asymmetrical, mutated malformed, mutilated, poorly lit, bad shadow, draft, cropped, out of frame, cut off, censored, jpeg artifacts, out of focus, glitch, duplicate, (airbrushed, cartoon, anime, semi-realistic, cgi, render, blender, digital art, manga, amateur:1.3), (3D ,3D Game, 3D Game Scene, 3D Character:1.1), (bad hands, bad anatomy, bad body, bad face, bad teeth, bad arms, bad legs, deformities:1.3)"
- source_model: SG161222/RealVisXL_V3.0
positive_prompt: "cinematic, portrait, photograph, instagram, fashion, movie, macro shot, 8K, RAW, hyperrealistic, ultra realistic,"
negative_prompt: "(octane render, render, drawing, anime, bad photo, bad photography:1.3), (worst quality, low quality, blurry:1.2), (bad teeth, deformed teeth, deformed lips), (bad anatomy, bad proportions:1.1), (deformed iris, deformed pupils), (deformed eyes, bad eyes), (deformed face, ugly face, bad face), (deformed hands, bad hands, fused fingers), morbid, mutilated, mutation, disfigured"
- source_model: albertushka/albertushka_DynaVisionXL
positive_prompt: "minimalist, illustration, award winning art, painting, impressionist, comic, colors, sketch, pencil drawing,"
negative_prompt: "Compression artifacts, bad art, worst quality, low quality, plastic, fake, bad limbs, conjoined, featureless, bad features, incorrect objects, watermark, ((signature):1.25), logo"
- source_model: frankjoshua/albedobaseXL_v13
positive_prompt: "photograph f/1.4, ISO 200, 1/160s, 8K, RAW, unedited, symmetrical balance, in-frame, 8K"
negative_prompt: "nsfw, lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, blurry"
其他變體
我們在Hugging Face上發佈了3個合併模型:
對比
從以下圖像可以看出,提示理解能力有所提高。從左到右依次為SegMoE-2x1-v0、SegMoE-4x2-v0、基礎模型(RealVisXL_V3.0)

三個綠色玻璃瓶

頭上戴著飛行員眼鏡的熊貓

自由女神像在華盛頓紀念碑旁邊
模型描述
適用範圍外使用
SegMoE-4x2-v0模型不適用於創建人物、事件或現實世界信息的事實性或準確表示。它不用於需要高精度和準確性的任務。
✨ 主要特性
- 多專家知識融合:受益於多個微調專家的知識。
- 無需訓練:無需進行訓練,即可快速組合模型。
- 更好的數據適應性:能更好地適應數據。
- 模型可升級:可以通過使用更好的微調模型作為專家之一來升級模型。
侷限性
- 依賴專家知識:雖然該模型在圖像保真度和貼合度上有所改進,但在未訓練的情況下,它不會比任何一個專家模型有顯著提升,並且依賴於專家的知識。
- 速度未優化:目前尚未針對速度進行優化。
- 內存使用未優化:該框架尚未針對內存使用進行優化。
📄 許可證
本模型採用Apache 2.0許可證。
引用
@misc{segmoe,
author = {Yatharth Gupta, Vishnu V Jaddipal, Harish Prabhala},
title = {SegMoE},
year = {2024},
publisher = {HuggingFace},
journal = {HuggingFace Models},
howpublished = {\url{https://huggingface.co/segmind/SegMoE-4x2-v0}}
}