đ Text2Video-Zero Model Card - ControlNet Canny Aracane Style
Text2Video-Zero is a zero-shot text-to-video generator. It can perform multiple tasks such as zero-shot text-to-video generation, Video Instruct Pix2Pix (instruction-guided video editing), text and pose conditional video generation, text and canny-edge conditional video generation, and text, canny-edge and dreambooth conditional video generation. This model provides DreamBooth weights for the Arcane style
to be used with edge guidance in text2video zero.
đ Quick Start
The Text2Video-Zero can perform various video generation and editing tasks. Our code is compatible with any StableDiffusion base model. For more information, please refer to our paper and our demo: 
⨠Features
- Multiple Generation Modes: Capable of zero-shot text-to-video generation, instruction-guided video editing, and conditional video generation based on text, pose, canny-edge, etc.
- Arcane Style Support: Provides DreamBooth weights for the
Arcane style
with edge guidance.
- Compatibility: Works with any StableDiffusion base model.
đĻ Installation
The README does not provide specific installation steps, so this section is skipped.
đģ Usage Examples
The README does not provide code examples, so this section is skipped.
đ Documentation
Weights for Text2Video-Zero
We converted the original weights into diffusers and made them usable for ControlNet with edge guidance using: https://github.com/lllyasviel/ControlNet/discussions/12.
Model Details
Property |
Details |
Developed by |
Levon Khachatryan, Andranik Movsisyan, Vahram Tadevosyan, Roberto Henschel, Zhangyang Wang, Shant Navasardyan and Humphrey Shi |
Model Type |
Dreambooth text-to-image and text-to-video generation model with edge control for text2video zero |
Language(s) |
English |
License |
The CreativeML OpenRAIL M license. |
Model Description |
This is a model for text2video zero with edge guidance and arcane style. It can be used also with ControlNet in a text-to-image setup with edge guidance. |
DreamBoth Keyword |
arcane style |
Resources for more information |
GitHub, Paper, CIVITAI. |
Cite as |
@article{text2video-zero, title={Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators}, author={Khachatryan, Levon and Movsisyan, Andranik and Tadevosyan, Vahram and Henschel, Roberto and Wang, Zhangyang and Navasardyan, Shant and Shi, Humphrey}, journal={arXiv preprint arXiv:2303.13439}, year={2023} } |
Original Weights
The Dreambooth weights for the Arcane style were taken from CIVITAI.
Model Details
Property |
Details |
Developed by |
Quiet_Joker (Username listed on CIVITAI) |
Model Type |
Dreambooth text-to-image generation model |
Language(s) |
English |
License |
The CreativeML OpenRAIL M license. |
Model Description |
This is a model that was created using DreamBooth to generate images with Arcane style, based on text prompts. |
DreamBoth Keyword |
arcane style |
Resources for more information |
CIVITAI. |
Biases content acknowledgement
â ī¸ Important Note
Beware that Text2Video-Zero may output content that reinforces or exacerbates societal biases, as well as realistic faces, pornography, and violence. Text2Video-Zero in this demo is meant only for research purposes.
đ License
The model is licensed under The CreativeML OpenRAIL M license.
đ Citation
@article{text2video-zero,
title={Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators},
author={Khachatryan, Levon and Movsisyan, Andranik and Tadevosyan, Vahram and Henschel, Roberto and Wang, Zhangyang and Navasardyan, Shant and Shi, Humphrey},
journal={arXiv preprint arXiv:2303.13439},
year={2023}
}