ltxvideo-disney開源模型 - 免費部署生成黑白迪士尼風格視頻內容

首頁

Ltxvideo Disney

由bghira開發

基於Lightricks/LTX-Video訓練的LyCORIS適配器，專注於生成黑白迪士尼風格的視頻內容。

文本生成視頻開源協議:其他 #黑白迪士尼風格 #LyCORIS適配器 #流匹配視頻生成

下載量 18

發布時間 : 3/21/2025

模型概述

該模型是一個文本生成視頻的適配器，特別擅長生成《蒸汽船威利》風格的黑白迪士尼場景。

模型特點

黑白迪士尼風格

特別擅長生成《蒸汽船威利》風格的黑白迪士尼場景。

LyCORIS適配器

基於Lightricks/LTX-Video訓練的LyCORIS適配器，提供更高效的微調能力。

流匹配預測

使用流匹配預測類型，優化視頻生成質量。

模型能力

文本生成視頻

圖像生成視頻

視頻生成視頻

使用案例

創意內容生成

黑白動畫風格視頻創作

生成具有復古黑白迪士尼風格的動畫視頻內容。

示例展示了《蒸汽船威利》風格的黑白迪士尼場景。

動漫風格動作場景

生成動漫角色在都市環境中進行動作表演的視頻。

示例展示了午夜霓虹都市中動漫主角的流暢動作。

🚀 ltxvideo-disney

這是一個基於Lightricks/LTX-Video的LyCORIS適配器。它能夠生成具有特定風格的黑白迪士尼場景視頻，為視頻生成領域帶來了新的可能性。

🚀 快速開始

你可以參考以下代碼示例，快速開始使用 ltxvideo-disney 進行推理：

import torch
from diffusers import DiffusionPipeline
from lycoris import create_lycoris_from_weights


def download_adapter(repo_id: str):
    import os
    from huggingface_hub import hf_hub_download
    adapter_filename = "pytorch_lora_weights.safetensors"
    cache_dir = os.environ.get('HF_PATH', os.path.expanduser('~/.cache/huggingface/hub/models'))
    cleaned_adapter_path = repo_id.replace("/", "_").replace("\\", "_").replace(":", "_")
    path_to_adapter = os.path.join(cache_dir, cleaned_adapter_path)
    path_to_adapter_file = os.path.join(path_to_adapter, adapter_filename)
    os.makedirs(path_to_adapter, exist_ok=True)
    hf_hub_download(
        repo_id=repo_id, filename=adapter_filename, local_dir=path_to_adapter
    )

    return path_to_adapter_file
    
model_id = 'Lightricks/LTX-Video'
adapter_repo_id = 'bghira/ltxvideo-disney'
adapter_filename = 'pytorch_lora_weights.safetensors'
adapter_file_path = download_adapter(repo_id=adapter_repo_id)
pipeline = DiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.bfloat16) # loading directly in bf16
lora_scale = 1.0
wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_file_path, pipeline.transformer)
wrapper.merge_to()

prompt = "A black and white disney scene in the style of Steamboat Willie"
negative_prompt = 'ugly, cropped, blurry, low-quality, mediocre average'

## Optional: quantise the model to save on vram.
## Note: The model was quantised during training, and so it is recommended to do the same during inference time.
from optimum.quanto import quantize, freeze, qint8
quantize(pipeline.transformer, weights=qint8)
freeze(pipeline.transformer)
    
pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu') # the pipeline is already in its target precision level
model_output = pipeline(
    prompt=prompt,
    negative_prompt=negative_prompt,
    num_inference_steps=25,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(42),
    width=768,
    height=512,
    guidance_scale=3.8,
).frames[0]

from diffusers.utils.export_utils import export_to_gif
export_to_gif(model_output, "output.gif", fps=25)

✨ 主要特性

風格獨特：能夠生成具有黑白迪士尼風格的視頻，風格類似《汽船威利號》。
可定製性強：提供了豐富的訓練和推理參數設置，用戶可以根據需求進行調整。
推理便捷：支持直接使用基礎模型的文本編碼器進行推理，無需額外訓練。

📚 詳細文檔

驗證設置

分類器自由引導（CFG）：3.8
CFG 重縮放：0.0
步數：25
採樣器：FlowMatchEulerDiscreteScheduler
隨機種子：42
分辨率：768x512

注意：驗證設置不一定與訓練設置相同。

你可以在以下圖庫中找到一些示例圖像：

文本編碼器未進行訓練，你可以重用基礎模型的文本編碼器進行推理。

訓練設置

訓練輪數：2666
訓練步數：8000
學習率：5e-05
- 學習率調度：餘弦
- 熱身步數：400000
最大梯度值：0.0
有效批量大小：24
- 微批量大小：8
- 梯度累積步數：1
- GPU 數量：3
梯度檢查點：啟用
預測類型：流匹配（額外參數=['training_scheduler_timestep_spacing=trailing', 'inference_scheduler_timestep_spacing=trailing']）
優化器：adamw_bf16
可訓練參數精度：純 BF16
基礎模型精度：int8-quanto
字幕丟棄概率：10.0%

LyCORIS 配置：

{
    "bypass_mode": true,
    "algo": "lokr",
    "multiplier": 1.0,
    "full_matrix": true,
    "linear_dim": 10000,
    "linear_alpha": 1,
    "factor": 4,
    "apply_preset": {
        "target_module": [
            "Attention",
            "FeedForward"
        ],
        "module_algo_map": {
            "FeedForward": {
                "factor": 4
            },
            "Attention": {
                "factor": 2
            }
        }
    }
}