redshift-man-skiing開源視頻生成模型 - 免費部署，用文本生成紅移風格運動視頻

首頁

Redshift Man Skiing

由Tune-A-Video-library開發

基於nitrosocke/redshift-diffusion模型微調的視頻生成模型，可通過文本提示生成紅移風格的運動視頻

視頻處理開源協議:Openrail #單樣本視頻生成 #紅移藝術風格 #運動場景轉換

下載量 17

發布時間 : 2/7/2023

模型概述

該模型通過Tune-A-Video方法對基礎擴散模型進行單樣本調優，實現從文本到視頻的生成，特別擅長生成紅移風格的運動場景

模型特點

紅移風格視頻生成

能夠生成具有獨特紅移藝術風格的動態視頻內容

單樣本調優

僅需單個訓練樣本即可調整基礎模型，實現特定場景的視頻生成

角色替換能力

保持原始動作框架的同時，可替換視頻中的角色（如蜘蛛俠、蝙蝠俠等）

模型能力

文本到視頻生成

風格化視頻合成

角色動作遷移

使用案例

創意內容生成

超級英雄運動場景

生成各種超級英雄在紅移風格下滑雪的視頻

可生成如蜘蛛俠、蝙蝠俠等角色滑雪的8幀動態GIF

藝術創作

紅移風格藝術視頻

創建具有獨特紅移美學風格的短視頻內容

512x512分辨率的風格化視頻輸出

🚀 Tune-A-Video - Redshift

Tune-A-Video - Redshift 是一個基於特定基礎模型訓練的項目，可用於文本到視頻的生成，通過特定的訓練提示，能生成如人物滑雪等風格的視頻。

🚀 快速開始

克隆倉庫

首先，你需要克隆 github 倉庫。

git clone https://github.com/showlab/Tune-A-Video.git

運行推理代碼

from tuneavideo.pipelines.pipeline_tuneavideo import TuneAVideoPipeline
from tuneavideo.models.unet import UNet3DConditionModel
from tuneavideo.util import save_videos_grid
import torch

pretrained_model_path = "nitrosocke/redshift-diffusion"
unet_model_path = "Tune-A-Video-library/redshift-man-skiing"
unet = UNet3DConditionModel.from_pretrained(unet_model_path, subfolder='unet', torch_dtype=torch.float16).to('cuda')
pipe = TuneAVideoPipeline.from_pretrained(pretrained_model_path, unet=unet, torch_dtype=torch.float16).to("cuda")
pipe.enable_xformers_memory_efficient_attention()

prompt = "(redshift style) spider man is skiing"
video = pipe(prompt, video_length=8, height=512, width=512, num_inference_steps=50, guidance_scale=7.5).videos

save_videos_grid(video, f"./{prompt}.gif")

✨ 主要特性

基於 nitrosocke/redshift-diffusion 基礎模型進行訓練。
通過特定的訓練提示 “a man is skiing” 進行訓練，可生成具有特定風格的視頻。

📦 安裝指南

克隆項目所需的倉庫：

git clone https://github.com/showlab/Tune-A-Video.git

💻 使用示例

基礎用法

from tuneavideo.pipelines.pipeline_tuneavideo import TuneAVideoPipeline
from tuneavideo.models.unet import UNet3DConditionModel
from tuneavideo.util import save_videos_grid
import torch

pretrained_model_path = "nitrosocke/redshift-diffusion"
unet_model_path = "Tune-A-Video-library/redshift-man-skiing"
unet = UNet3DConditionModel.from_pretrained(unet_model_path, subfolder='unet', torch_dtype=torch.float16).to('cuda')
pipe = TuneAVideoPipeline.from_pretrained(pretrained_model_path, unet=unet, torch_dtype=torch.float16).to("cuda")
pipe.enable_xformers_memory_efficient_attention()

prompt = "(redshift style) spider man is skiing"
video = pipe(prompt, video_length=8, height=512, width=512, num_inference_steps=50, guidance_scale=7.5).videos

save_videos_grid(video, f"./{prompt}.gif")