T

Text2motion

Developed by Quantamhash
An open and advanced large-scale video generation model suite supporting multiple tasks including text-to-video and image-to-video generation
Downloads 233
Release Time : 3/21/2025

Model Overview

Text-to-Motion is a comprehensive open-source video foundation model suite that pushes the boundaries of video generation, supporting bilingual text input (Chinese/English) and dual resolutions (480P/720P)

Model Features

State-of-the-art performance
Outperforms existing open-source models and commercial solutions across multiple benchmarks
Consumer GPU support
T2V-1.3B model requires only 8.19GB VRAM, generating 5-second 480P video in ~4 minutes on RTX 4090
Multi-task capability
Supports various tasks including text-to-video, image-to-video, and video editing
Bilingual text generation
First video generation model supporting both Chinese and English text input
Efficient video VAE
Maintains temporal information when encoding/decoding arbitrary-length 1080P videos with optimal efficiency and performance

Model Capabilities

Text-to-video
Image-to-video
Video editing
Text-to-image
Video-to-audio

Use Cases

Entertainment content creation
Animated short generation
Generate anthropomorphic animal animations from text descriptions
Example: Generate 480P/720P video of two anthropomorphic cats boxing
Advertisement production
Product showcase videos
Automatically generate product demonstration videos from descriptions
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase