đ Text2Video-Zero Model Card - ControlNet Canny Anime Style
Text2Video-Zero is a zero-shot text-to-video generator. It can perform various tasks such as zero-shot text-to-video generation, Video Instruct Pix2Pix (instruction-guided video editing), text and pose conditional video generation, text and canny-edge conditional video generation, and text, canny-edge and dreambooth conditional video generation. This model provides DreamBooth weights for the Anime style
to be used with edge guidance in text2video zero.
đ Quick Start
Text2Video-Zero is a powerful zero-shot text-to-video generator. It can perform multiple tasks, including zero-shot text-to-video generation
, Video Instruct Pix2Pix
(instruction-guided video editing), text and pose conditional video generation
, text and canny-edge conditional video generation
, and text, canny-edge and dreambooth conditional video generation
. For more information, please refer to our paper and our demo:
. Our code is compatible with any StableDiffusion base model.
This model offers DreamBooth weights for the Anime style
to be used with edge guidance (using ControlNet) in text2video zero.
⨠Features
- Multiple Generation Modes: Supports zero-shot text-to-video generation, instruction-guided video editing, and conditional video generation based on text, pose, canny-edge, and dreambooth.
- Anime Style Support: Provides DreamBooth weights for the Anime style with edge guidance.
- Compatibility: Works with any StableDiffusion base model.
đ Documentation
Weights for Text2Video-Zero
We converted the original weights into diffusers and made them usable for ControlNet with edge guidance using: https://github.com/lllyasviel/ControlNet/discussions/12.
Model Details
Property |
Details |
Developed by |
Levon Khachatryan, Andranik Movsisyan, Vahram Tadevosyan, Roberto Henschel, Zhangyang Wang, Shant Navasardyan and Humphrey Shi |
Model Type |
Dreambooth text-to-image and text-to-video generation model with edge control for text2video zero |
Language(s) |
English |
License |
The CreativeML OpenRAIL M license |
Model Description |
This is a model for text2video zero with edge guidance and anime style. It can also be used with ControlNet in a text-to-image setup with edge guidance. |
DreamBoth Keyword |
anime style |
Resources for more information |
GitHub, Paper, CIVITAI |
Original Weights
The Dreambooth weights for the Anime style were taken from CIVITAI.
Model Details
Property |
Details |
Developed by |
Quiet_Joker (Username listed on CIVITAI) |
Model Type |
Dreambooth text-to-image generation model |
Language(s) |
English |
License |
The CreativeML OpenRAIL M license |
Model Description |
This is a model that was created using DreamBooth to generate images with Anime style, based on text prompts. |
DreamBoth Keyword |
anime style |
Resources for more information |
CIVITAI |
Biases content acknowledgement
â ī¸ Important Note
Beware that Text2Video-Zero may output content that reinforces or exacerbates societal biases, as well as realistic faces, pornography, and violence. Text2Video-Zero in this demo is meant only for research purposes.
đ License
This model is licensed under The CreativeML OpenRAIL M license.
đ Citation
@article{text2video-zero,
title={Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators},
author={Khachatryan, Levon and Movsisyan, Andranik and Tadevosyan, Vahram and Henschel, Roberto and Wang, Zhangyang and Navasardyan, Shant and Shi, Humphrey},
journal={arXiv preprint arXiv:2303.13439},
year={2023}
}