RCNA_MINI開源模型 - 免費部署，四步實現高質量文本到視頻轉換

首頁

RCNA MINI

由Binarybardakshat開發

RCNA MINI 是一款緊湊型 LoRA 模型，專為生成高質量的四步文本轉視頻輸出而設計。

文本生成視頻支持多種語言開源協議:Apache-2.0 #四步文生視頻 #LoRA高效微調 #8K高分辨率

下載量 18

發布時間 : 9/29/2024

模型概述

RCNA MINI 是一款基於 LoRA 架構的文本生成視頻模型，能夠快速生成高質量、細節豐富的短視頻片段，適用於創意內容和社交媒體。

模型特點

四步文本轉視頻

僅需4步即可根據文本提示生成視頻，生成速度快。

高質量輸出

支持高達8K的高分辨率和細節呈現，生成視頻質量高。

快速採樣

通過解耦一致性學習技術優化生成速度，同時保證質量。

緊湊型設計

基於LoRA架構，計算開銷小，適合快速部署。

模型能力

文本生成視頻

高分辨率視頻生成

快速視頻生成

使用案例

社交媒體

短視頻動畫

為社交媒體平臺生成吸引人的短視頻動畫內容。

生成4至16秒的高質量視頻片段。

創意項目

藝術視頻創作

基於文本描述生成藝術視頻，用於創意項目或視覺藝術。

細節豐富、過渡流暢的短動畫。

🚀 RCNA MINI

RCNA MINI 是一款緊湊的 LoRA（低秩自適應）模型，專為生成高質量的 4 步文本到視頻輸出而設計。它可以創建時長從 4 秒到 16 秒的視頻片段，非常適合生成具有豐富細節和流暢過渡的短動畫。

🚀 快速開始

RCNA MINI 是基於 LoRA 架構的模型，能夠根據文本描述快速生成視頻。以下是使用它的示例代碼：

import torch
from diffusers import AnimateDiffPipeline, LCMScheduler, MotionAdapter, DiffusionPipeline
from diffusers.utils import export_to_gif

# Load AnimateLCM for video generation
adapter = MotionAdapter.from_pretrained("Binarybardakshat/RCNA_MINI")
pipe = AnimateDiffPipeline.from_pretrained("emilianJR/epiCRealism", motion_adapter=adapter, torch_dtype=torch.float16)
pipe.scheduler = LCMScheduler.from_config(pipe.scheduler.config, beta_schedule="linear")
pipe.load_lora_weights("Binarybardakshat/RCNA_MINI", weight_name="RCNA_LORA_MINI_1.safetensors", adapter_name="lcm-lora")
pipe.set_adapters(["lcm-lora"], [0.8])
pipe.enable_vae_slicing()
pipe.enable_model_cpu_offload()

# Generate video using RCNA MINI
output = pipe(
    prompt="A space rocket with trails of smoke behind it launching into space from the desert, 4k, high resolution",
    negative_prompt="bad quality, worse quality, low resolution",
    num_frames=16,
    guidance_scale=2.0,
    num_inference_steps=6,
    generator=torch.Generator("cpu").manual_seed(0),
)
frames = output.frames[0]
export_to_gif(frames, "animatelcm.gif")
print("Video and image generation complete!")

✨ 主要特性

4 步文本到視頻：僅需 4 步即可根據文本提示生成視頻。
視頻長度：可生成 4 秒至 16 秒長的視頻。
高質量：支持高分辨率和詳細的輸出（最高可達 8K）。
快速採樣：利用解耦一致性學習，該模型在保證質量的同時優化了速度。

💻 使用示例

基礎用法

# 上述快速開始中的代碼即為基礎用法示例
import torch
from diffusers import AnimateDiffPipeline, LCMScheduler, MotionAdapter, DiffusionPipeline
from diffusers.utils import export_to_gif

# Load AnimateLCM for video generation
adapter = MotionAdapter.from_pretrained("Binarybardakshat/RCNA_MINI")
pipe = AnimateDiffPipeline.from_pretrained("emilianJR/epiCRealism", motion_adapter=adapter, torch_dtype=torch.float16)
pipe.scheduler = LCMScheduler.from_config(pipe.scheduler.config, beta_schedule="linear")
pipe.load_lora_weights("Binarybardakshat/RCNA_MINI", weight_name="RCNA_LORA_MINI_1.safetensors", adapter_name="lcm-lora")
pipe.set_adapters(["lcm-lora"], [0.8])
pipe.enable_vae_slicing()
pipe.enable_model_cpu_offload()

# Generate video using RCNA MINI
output = pipe(
    prompt="A space rocket with trails of smoke behind it launching into space from the desert, 4k, high resolution",
    negative_prompt="bad quality, worse quality, low resolution",
    num_frames=16,
    guidance_scale=2.0,
    num_inference_steps=6,
    generator=torch.Generator("cpu").manual_seed(0),
)
frames = output.frames[0]
export_to_gif(frames, "animatelcm.gif")
print("Video and image generation complete!")

高級用法

# 可以根據不同的需求調整參數，例如修改提示詞、幀數、引導比例等
import torch
from diffusers import AnimateDiffPipeline, LCMScheduler, MotionAdapter, DiffusionPipeline
from diffusers.utils import export_to_gif

# Load AnimateLCM for video generation
adapter = MotionAdapter.from_pretrained("Binarybardakshat/RCNA_MINI")
pipe = AnimateDiffPipeline.from_pretrained("emilianJR/epiCRealism", motion_adapter=adapter, torch_dtype=torch.float16)
pipe.scheduler = LCMScheduler.from_config(pipe.scheduler.config, beta_schedule="linear")
pipe.load_lora_weights("Binarybardakshat/RCNA_MINI", weight_name="RCNA_LORA_MINI_1.safetensors", adapter_name="lcm-lora")
pipe.set_adapters(["lcm-lora"], [0.8])
pipe.enable_vae_slicing()
pipe.enable_model_cpu_offload()

# 調整提示詞和幀數
output = pipe(
    prompt="A beautiful forest with colorful flowers, 8k, high resolution",
    negative_prompt="bad quality, worse quality, low resolution",
    num_frames=20,
    guidance_scale=2.5,
    num_inference_steps=8,
    generator=torch.Generator("cpu").manual_seed(1),
)
frames = output.frames[0]
export_to_gif(frames, "advanced_animatelcm.gif")
print("Advanced video and image generation complete!")