text2video-zero-controlnet-canny-avatar Open Source Model - Effortlessly Achieve Text-to-Video Generation and Edge Condition Control

Text2video Zero Controlnet Canny Avatar

Developed by PAIR

Text2Video-Zero is a zero-shot text-to-video generator that combines ControlNet's edge guidance with Avatar-style DreamBooth weights to enable text-to-video generation and edge condition control.

Text-to-Video Open Source License:Openrail #Zero-shot Video Generation #Edge-guided Generation #Avatar Style

Downloads 126

Release Time : 3/27/2023

Model Overview

This model supports zero-shot text-to-video generation, video instruction-guided editing, text and pose-conditioned video generation, text and Canny edge-conditioned video generation, and is particularly suitable for Avatar-style video creation.

Model Features

Zero-shot Video Generation

Generate videos from text without additional training

Edge Condition Control

Achieve Canny edge-guided video generation through ControlNet

Avatar Style

Integrated DreamBooth weights support Avatar-style video creation

Versatile Applications

Supports multiple functions including text-to-video, video editing, and pose-conditioned generation

Model Capabilities

Text-to-video generation

Edge-guided video generation

Video editing

Pose-conditioned video generation

Stylized video creation

Use Cases

Creative Content Production

Avatar-style Video Creation

Generate short Avatar-style videos based on text prompts

Stylistically consistent animated videos

Edge-guided Video Generation

Use Canny edge maps to control video content structure

Structurally precise video output

Video Editing

Instruction-guided Video Editing

Edit existing video content through text instructions

Modified videos maintain original style

🚀 Text2Video-Zero Model Card - ControlNet Canny Avatar Style

Text2Video-Zero is a zero-shot text to video generator. It can perform various tasks such as zero-shot text-to-video generation, Video Instruct Pix2Pix (instruction-guided video editing), text and pose conditional video generation, text and canny-edge conditional video generation, and text, canny-edge and dreambooth conditional video generation. For more details, check our paper and our demo: . Our code is compatible with any StableDiffusion base model. This model offers DreamBooth weights for the Avatar style for use with edge guidance (using ControlNet) in text2video zero.

🚀 Quick Start

This section provides an overview of the Text2Video - Zero model and its capabilities. The model can generate videos from text prompts and perform multiple video - related tasks.

✨ Features

Zero - shot Text - to - Video Generation: Generate videos directly from text prompts without extensive fine - tuning.
Video Editing: Perform instruction - guided video editing with Video Instruct Pix2Pix.
Conditional Video Generation: Generate videos based on text, pose, canny - edge, and dreambooth conditions.

📚 Documentation

Weights for Text2Video - Zero

We converted the original weights into diffusers and made them usable for ControlNet with edge guidance using: https://github.com/lllyasviel/ControlNet/discussions/12.

Model Details

Property	Details
Developed by	Levon Khachatryan, Andranik Movsisyan, Vahram Tadevosyan, Roberto Henschel, Zhangyang Wang, Shant Navasardyan and Humphrey Shi
Model Type	Dreambooth text - to - image and text - to - video generation model with edge control for text2video zero
Language(s)	English
License	The CreativeML OpenRAIL M license
Model Description	This is a model for text2video zero with edge guidance and avatar style. It can also be used with ControlNet in a text - to - image setup with edge guidance.
DreamBoth Keyword	avatar style
Resources for more information	GitHub, Paper, CIVITAI

Original Weights

The Dreambooth weights for the Avatar style were taken from CIVITAI.

Model Details

Property	Details
Developed by	Quiet_Joker (Username listed on CIVITAI)
Model Type	Dreambooth text - to - image generation model
Language(s)	English
License	The CreativeML OpenRAIL M license
Model Description	This is a model that was created using DreamBooth to generate images with avatar style, based on text prompts.
DreamBoth Keyword	avatar style
Resources for more information	CIVITAI

Biases content acknowledgement

⚠️ Important Note

Be aware that Text2Video - Zero may output content that reinforces or exacerbates societal biases, as well as realistic faces, pornography, and violence. Text2Video - Zero in this demo is for research purposes only.

📄 License

The model is under The CreativeML OpenRAIL M license.

📚 Citation

@article{text2video-zero,
  title={Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators},
  author={Khachatryan, Levon and Movsisyan, Andranik and Tadevosyan, Vahram and Henschel, Roberto and Wang, Zhangyang and Navasardyan, Shant and Shi, Humphrey},
  journal={arXiv preprint arXiv:2303.13439},
  year={2023}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご