Stable Diffusion 3.5 Medium开源文生图模型 - 支持多风格免费生成图像

首页

Sd35m Sfwbooru Lycoris

由 bghira 开发

Stable Diffusion 3.5 Medium 是一个基于扩散模型的文生图/图生图模型，支持多种风格的图像生成，包括幻想、科幻、赛博朋克等。

图像生成开源协议:其他 #多风格文生图 #高细节渲染 #赛博朋克创作

下载量 595

发布时间 : 3/25/2025

模型简介

该模型是一个基于扩散模型的图像生成模型，能够根据文本提示生成高质量的图像，支持多种风格和应用场景。

模型特点

高质量图像生成

能够生成高分辨率、高细节的图像，适用于多种风格和场景。

多风格支持

支持幻想、科幻、赛博朋克、中世纪等多种风格。

文生图与图生图

支持根据文本提示生成图像，也支持基于现有图像进行修改和增强。

LoRA和LyCORIS支持

支持LoRA和LyCORIS等轻量级微调技术，便于模型定制和优化。

模型能力

文本到图像生成

图像到图像生成

高分辨率图像生成

多风格图像生成

支持LoRA微调

支持LyCORIS微调

使用案例

艺术创作

幻想艺术

生成幻想风格的图像，如魔法森林、巨龙等。

高细节、高分辨率的幻想艺术图像。

科幻场景

生成科幻风格的图像，如未来城市、太空战斗等。

具有未来感的科幻场景图像。

游戏设计

角色设计

生成游戏角色概念图，如赛博格、精灵等。

多样化的角色设计图像。

场景设计

生成游戏场景概念图，如中世纪市场、废弃游乐场等。

丰富的场景设计图像。

广告与营销

广告素材

生成广告所需的图像素材，如霓虹灯招牌、复古餐厅等。

吸引眼球的广告图像。

产品展示

生成产品展示图像，如复古车辆、古董店等。

高质量的产品展示图像。

🚀 sd35m-sfwbooru-lycoris

这是一个基于 stabilityai/stable-diffusion-3.5-medium 的 LyCORIS 适配器。它能够在图像生成任务中，基于基础模型生成更符合特定需求的图像，为图像生成领域带来更多可能性。

🚀 快速开始

推理示例

以下是使用该适配器进行推理的 Python 代码示例：

import torch
from diffusers import DiffusionPipeline
from lycoris import create_lycoris_from_weights


def download_adapter(repo_id: str):
    import os
    from huggingface_hub import hf_hub_download
    adapter_filename = "pytorch_lora_weights.safetensors"
    cache_dir = os.environ.get('HF_PATH', os.path.expanduser('~/.cache/huggingface/hub/models'))
    cleaned_adapter_path = repo_id.replace("/", "_").replace("\\", "_").replace(":", "_")
    path_to_adapter = os.path.join(cache_dir, cleaned_adapter_path)
    path_to_adapter_file = os.path.join(path_to_adapter, adapter_filename)
    os.makedirs(path_to_adapter, exist_ok=True)
    hf_hub_download(
        repo_id=repo_id, filename=adapter_filename, local_dir=path_to_adapter
    )

    return path_to_adapter_file
    
model_id = 'stabilityai/stable-diffusion-3.5-medium'
adapter_repo_id = 'bghira/sd35m-sfwbooru-lycoris'
adapter_filename = 'pytorch_lora_weights.safetensors'
adapter_file_path = download_adapter(repo_id=adapter_repo_id)
pipeline = DiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.bfloat16) # loading directly in bf16
lora_scale = 1.0
wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_file_path, pipeline.transformer)
wrapper.merge_to()

prompt = "A photo-realistic image of a cat"
negative_prompt = 'blurry, cropped, ugly'

## Optional: quantise the model to save on vram.
## Note: The model was not quantised during training, so it is not necessary to quantise it during inference time.
#from optimum.quanto import quantize, freeze, qint8
#quantize(pipeline.transformer, weights=qint8)
#freeze(pipeline.transformer)
    
pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu') # the pipeline is already in its target precision level
model_output = pipeline(
    prompt=prompt,
    negative_prompt=negative_prompt,
    num_inference_steps=30,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(42),
    width=1024,
    height=1024,
    guidance_scale=3.2,
    skip_guidance_layers=[7, 8, 9],
).images[0]

model_output.save("output.png", format="PNG")

验证设置

CFG: 3.2
CFG 缩放: 0.0
步数: 30
采样器: FlowMatchEulerDiscreteScheduler
种子: 42
分辨率: 1024x1024
跳过层引导: skip_guidance_layers=[7, 8, 9]

注意：验证设置不一定与训练设置相同。

训练设置

属性	详情
训练轮数	3
训练步数	220250
学习率	5e-06
学习率调度	余弦
热身步数	500000
最大梯度值	0.01
有效批量大小	6
微批量大小	6
梯度累积步数	1
GPU 数量	1
梯度检查点	启用
预测类型	流匹配 (额外参数=['shift=3'])
优化器	optimi-lion
可训练参数精度	纯 BF16
基础模型精度	`no_change`
字幕丢弃概率	10.0%

LyCORIS 配置

{
    "algo": "lokr",
    "multiplier": 1.0,
    "full_matrix": true,
    "linear_alpha": 1,
    "factor": 16,
    "apply_preset": {
        "target_module": [
            "Attention"
        ],
        "module_algo_map": {
            "Attention": {
                "factor": 6
            }
        }
    }
}