W

Wan2.1 T2V 1.3B Diffusers

Developed by Wan-AI
Wan 2.1 is a comprehensive open-source video foundation model featuring top-tier performance, consumer-grade GPU support, multi-task capabilities, visual-text generation, and efficient video VAE.
Downloads 45.29k
Release Time : 3/1/2025

Model Overview

Wan 2.1 is an open and advanced large-scale video generation model designed to push the boundaries of video generation. It supports tasks such as text-to-video, image-to-video, video editing, text-to-image, and video-to-audio.

Model Features

Top-tier performance
Consistently outperforms existing open-source models and commercial solutions in multiple benchmarks.
Consumer-grade GPU support
The T2V-1.3B model requires only 8.19GB of VRAM and is compatible with almost all consumer-grade GPUs.
Multi-task support
Excels in text-to-video, image-to-video, video editing, text-to-image, and video-to-audio, driving advancements in video generation.
Visual-text generation
The first video model supporting bilingual (English and Chinese) text generation, with powerful text generation capabilities significantly enhancing practical value.
Efficient video VAE
Wan-VAE maintains temporal information when encoding and decoding arbitrary-length 1080P videos, providing an ideal foundation for video and image generation.

Model Capabilities

Text-to-video
Image-to-video
Video editing
Text-to-image
Video-to-audio

Use Cases

Creative video production
Animated short film generation
Generate animated short films with anthropomorphic characters using text descriptions.
Generating a 5-second 480P video takes approximately 4 minutes (RTX 4090)
Video editing
Video style transfer
Transform existing videos into different styles.
Featured Recommended AI Models
ยฉ 2025AIbase