đ Text2Video-Zero Model Card - ControlNet Canny Avatar Style
Text2Video-Zero is a zero-shot text to video generator. It can perform various tasks such as zero-shot text-to-video generation
, Video Instruct Pix2Pix
(instruction-guided video editing), text and pose conditional video generation
, text and canny-edge conditional video generation
, and text, canny-edge and dreambooth conditional video generation
. For more details, check our paper and our demo:
. Our code is compatible with any StableDiffusion base model. This model offers DreamBooth weights for the Avatar style
for use with edge guidance (using ControlNet) in text2video zero.
đ Quick Start
This section provides an overview of the Text2Video - Zero model and its capabilities. The model can generate videos from text prompts and perform multiple video - related tasks.
⨠Features
- Zero - shot Text - to - Video Generation: Generate videos directly from text prompts without extensive fine - tuning.
- Video Editing: Perform instruction - guided video editing with
Video Instruct Pix2Pix
.
- Conditional Video Generation: Generate videos based on text, pose, canny - edge, and dreambooth conditions.
đ Documentation
Weights for Text2Video - Zero
We converted the original weights into diffusers and made them usable for ControlNet with edge guidance using: https://github.com/lllyasviel/ControlNet/discussions/12.
Model Details
Property |
Details |
Developed by |
Levon Khachatryan, Andranik Movsisyan, Vahram Tadevosyan, Roberto Henschel, Zhangyang Wang, Shant Navasardyan and Humphrey Shi |
Model Type |
Dreambooth text - to - image and text - to - video generation model with edge control for text2video zero |
Language(s) |
English |
License |
The CreativeML OpenRAIL M license |
Model Description |
This is a model for text2video zero with edge guidance and avatar style. It can also be used with ControlNet in a text - to - image setup with edge guidance. |
DreamBoth Keyword |
avatar style |
Resources for more information |
GitHub, Paper, CIVITAI |
Original Weights
The Dreambooth weights for the Avatar style were taken from CIVITAI.
Model Details
Property |
Details |
Developed by |
Quiet_Joker (Username listed on CIVITAI) |
Model Type |
Dreambooth text - to - image generation model |
Language(s) |
English |
License |
The CreativeML OpenRAIL M license |
Model Description |
This is a model that was created using DreamBooth to generate images with avatar style, based on text prompts. |
DreamBoth Keyword |
avatar style |
Resources for more information |
CIVITAI |
Biases content acknowledgement
â ī¸ Important Note
Be aware that Text2Video - Zero may output content that reinforces or exacerbates societal biases, as well as realistic faces, pornography, and violence. Text2Video - Zero in this demo is for research purposes only.
đ License
The model is under The CreativeML OpenRAIL M license.
đ Citation
@article{text2video-zero,
title={Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators},
author={Khachatryan, Levon and Movsisyan, Andranik and Tadevosyan, Vahram and Henschel, Roberto and Wang, Zhangyang and Navasardyan, Shant and Shi, Humphrey},
journal={arXiv preprint arXiv:2303.13439},
year={2023}
}