cogvideox-disney-adamw-4000-0.0003-constant开源模型

首页

Cogvideox Disney Adamw 4000 0.0003 Constant

由 a-r-r-o-w 开发

基于CogVideoX-5b模型进行LoRA微调的版本，专注于迪士尼风格视频生成

文本生成视频英语#迪士尼风格视频生成 #LoRA高效微调 #动态文本转视频

下载量 16

发布时间 : 10/8/2024

模型简介

这是一个使用LoRA技术对CogVideoX-5b模型进行微调的版本，专门针对迪士尼风格视频生成任务进行了优化。

模型特点

LoRA微调技术

使用LoRA(Low-Rank Adaptation)技术进行高效微调，显著降低训练成本

迪士尼风格优化

针对迪士尼风格视频生成进行了专门优化

高效训练

采用TorchAO和DeepSpeed技术实现内存优化训练

模型能力

文本到视频生成

迪士尼风格视频创作

动态场景生成

使用案例

创意内容生成

迪士尼风格动画生成

根据文本描述生成迪士尼风格的动画场景

可生成包含角色互动、情感表达的动态视频

教育娱乐

教育动画制作

快速生成教育类动画内容

🚀 CogVideoX LoRA微调

这是一个针对文本到视频生成的项目，基于CogVideoX模型进行LoRA微调，能够利用特定数据集训练出具有特定风格的视频生成模型。

🚀 快速开始

本项目是 THUDM/CogVideoX - 5b 模型的LoRA微调版本。模型使用 CogVideoX Factory 进行训练，该仓库包含了使用 TorchAO 和 DeepSpeed 对CogVideoX系列模型进行内存优化的训练脚本。这些脚本改编自 CogVideoX Diffusers trainer。

✨ 主要特性

基于CogVideoX模型进行LoRA微调，可高效利用计算资源。
使用内存优化的训练脚本，适用于大规模数据集训练。
支持在Diffusers库中加载和使用LoRA权重。

📦 安装指南

使用此模型需要安装 🧨 Diffusers库。

💻 使用示例

基础用法

import torch
from diffusers import CogVideoXPipeline
from diffusers.utils import export_to_video

pipe = CogVideoXPipeline.from_pretrained("THUDM/CogVideoX-5b", torch_dtype=torch.bfloat16).to("cuda")
pipe.load_lora_weights("a-r-r-o-w/cogvideox-disney-adamw-4000-0.0003-constant", weight_name="pytorch_lora_weights.safetensors", adapter_name="cogvideox-lora")

# The LoRA adapter weights are determined by what was used for training.
# In this case, we assume `--lora_alpha` is 32 and `--rank` is 64.
# It can be made lower or higher from what was used in training to decrease or amplify the effect
# of the LoRA upto a tolerance, beyond which one might notice no effect at all or overflows.
pipe.set_adapters(["cogvideox-lora"], [32 / 64])

video = pipe("BW_STYLE A black and white animated scene unfolds with an anthropomorphic goat surrounded by musical notes and symbols, suggesting a playful environment. Mickey Mouse appears, leaning forward in curiosity as the goat remains still. The goat then engages with Mickey, who bends down to converse or react. The dynamics shift as Mickey grabs the goat, potentially in surprise or playfulness, amidst a minimalistic background. The scene captures the evolving relationship between the two characters in a whimsical, animated setting, emphasizing their interactions and emotions", guidance_scale=6, use_dynamic_cfg=True).frames[0]
export_to_video(video, "output.mp4", fps=8)

更多详细信息，包括LoRA的加权、合并和融合，请查看 Diffusers中加载LoRA的文档。

📚 详细文档

下载模型

在 Files & Versions 标签中下载LoRA权重。

📄 许可证

请遵守此处和此处描述的许可条款。

信息表格

属性	详情
数据集	Wild - Heart/Disney - VideoGeneration - Dataset
基础模型	THUDM/CogVideoX - 5b
任务类型	文本到视频
库名称	diffusers
标签	text - to - video、diffusers - training、diffusers、lora、cogvideox、cogvideox - diffusers