text2video-zero-controlnet-canny-anime Open Source Model - Generate Anime-style Text-to-Video for Free, Supporting Edge Control

Text2video Zero Controlnet Canny Anime

Developed by PAIR

A zero-shot text-to-video generator based on Text2Video-Zero, optimized for anime style with edge-guided control support

Text-to-Video Open Source License:Openrail #Zero-shot video generation #Anime style control #Edge-guided generation

Downloads 79

Release Time : 3/25/2023

Model Overview

This model combines DreamBooth fine-tuned weights with ControlNet edge detection technology to achieve anime-style text-to-video generation and editing

Model Features

Zero-shot video generation

Generate videos directly from text without additional training

Anime style optimization

Uses DreamBooth fine-tuned weights specifically optimized for anime style output

Edge-guided control

Conditional control based on edge detection through ControlNet

Multi-modal input

Supports combined inputs like text + pose/edge detection

Model Capabilities

Text-to-video generation

Video instruction editing

Pose-conditioned generation

Edge detection conditioned generation

Anime style generation

Use Cases

Creative content generation

Anime short video creation

Automatically generate anime-style short videos from text descriptions

Can produce coherent 10-30 second animation clips

Video style conversion

Convert live-action videos to anime style

Maintains original motion while transforming visual style

Film pre-production

Animation storyboard generation

Quickly generate animation storyboard drafts

Accelerates pre-production workflow

🚀 Text2Video-Zero Model Card - ControlNet Canny Anime Style

Text2Video-Zero is a zero-shot text-to-video generator. It can perform various tasks such as zero-shot text-to-video generation, Video Instruct Pix2Pix (instruction-guided video editing), text and pose conditional video generation, text and canny-edge conditional video generation, and text, canny-edge and dreambooth conditional video generation. This model provides DreamBooth weights for the Anime style to be used with edge guidance in text2video zero.

🚀 Quick Start

Text2Video-Zero is a powerful zero-shot text-to-video generator. It can perform multiple tasks, including zero-shot text-to-video generation, Video Instruct Pix2Pix (instruction-guided video editing), text and pose conditional video generation, text and canny-edge conditional video generation, and text, canny-edge and dreambooth conditional video generation. For more information, please refer to our paper and our demo: . Our code is compatible with any StableDiffusion base model.

This model offers DreamBooth weights for the Anime style to be used with edge guidance (using ControlNet) in text2video zero.

✨ Features

Multiple Generation Modes: Supports zero-shot text-to-video generation, instruction-guided video editing, and conditional video generation based on text, pose, canny-edge, and dreambooth.
Anime Style Support: Provides DreamBooth weights for the Anime style with edge guidance.
Compatibility: Works with any StableDiffusion base model.

📚 Documentation

Weights for Text2Video-Zero

We converted the original weights into diffusers and made them usable for ControlNet with edge guidance using: https://github.com/lllyasviel/ControlNet/discussions/12.

Model Details

Property	Details
Developed by	Levon Khachatryan, Andranik Movsisyan, Vahram Tadevosyan, Roberto Henschel, Zhangyang Wang, Shant Navasardyan and Humphrey Shi
Model Type	Dreambooth text-to-image and text-to-video generation model with edge control for text2video zero
Language(s)	English
License	The CreativeML OpenRAIL M license
Model Description	This is a model for text2video zero with edge guidance and anime style. It can also be used with ControlNet in a text-to-image setup with edge guidance.
DreamBoth Keyword	anime style
Resources for more information	GitHub, Paper, CIVITAI

Original Weights

The Dreambooth weights for the Anime style were taken from CIVITAI.

Model Details

Property	Details
Developed by	Quiet_Joker (Username listed on CIVITAI)
Model Type	Dreambooth text-to-image generation model
Language(s)	English
License	The CreativeML OpenRAIL M license
Model Description	This is a model that was created using DreamBooth to generate images with Anime style, based on text prompts.
DreamBoth Keyword	anime style
Resources for more information	CIVITAI

Biases content acknowledgement

⚠️ Important Note

Beware that Text2Video-Zero may output content that reinforces or exacerbates societal biases, as well as realistic faces, pornography, and violence. Text2Video-Zero in this demo is meant only for research purposes.

📄 License

This model is licensed under The CreativeML OpenRAIL M license.

📖 Citation

@article{text2video-zero,
  title={Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators},
  author={Khachatryan, Levon and Movsisyan, Andranik and Tadevosyan, Vahram and Henschel, Roberto and Wang, Zhangyang and Navasardyan, Shant and Shi, Humphrey},
  journal={arXiv preprint arXiv:2303.13439},
  year={2023}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご