W

Wan2.1 VACE 14B

Developed by Wan-AI
Wan2.1 is a comprehensive and open video foundation model designed to push the boundaries of video generation, supporting various video generation and editing tasks.
Downloads 8,797
Release Time : 5/13/2025

Model Overview

Wan2.1 is an advanced video generation model with multi-task support including text-to-video, image-to-video, video editing, text-to-image, and video-to-audio, driving progress in the field of video generation.

Model Features

SOTA performance
Consistently surpasses existing open-source models and state-of-the-art commercial solutions in multiple benchmarks.
Consumer-grade GPU support
The T2V-1.3B model requires only 8.19GB of VRAM and is compatible with almost all consumer-grade GPUs.
Multi-task support
Excels in text-to-video, image-to-video, video editing, text-to-image, and video-to-audio tasks.
Visual text generation
The first video model capable of generating bilingual text in Chinese and English, with strong text generation capabilities.
Efficient video VAE
Wan-VAE maintains temporal information when encoding and decoding 1080P videos of arbitrary lengths.

Model Capabilities

Text-to-video generation
Image-to-video generation
Video editing
Text-to-image generation
Video-to-audio generation
Chinese-English bilingual text generation

Use Cases

Video creation
Short video generation
Generate short video content based on text descriptions.
Generating a 5-second 480P video takes approximately 4 minutes (RTX 4090).
Video editing
Video style transfer
Modify video style based on reference images or text.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase