# Text-to-Video Generation

Skyreels V2 T2V 14B 720P VACE GGUF
Other
SkyReels-V2 is a 14B-parameter text-to-video generation model that supports 720P resolution output and includes VACE functionality.
Text-to-Video English
S
QuantStack
146
6
Wan2.1 VACE 1.3B GGUF
Apache-2.0
A direct GGUF conversion version of Wan2.1-VACE-1.3B, an open-source video foundation model compatible with consumer-grade GPUs, excelling in various video generation tasks.
Text-to-Video English
W
samuelchristlie
561
0
Moviigen1.1 VACE GGUF
Apache-2.0
This is an experimental GGUF conversion version of ZuluVision/MoviiGen1.1, integrated with the VACE plugin for text-to-video tasks.
Text-to-Video
M
QuantStack
1,222
7
Wan2.1 T2V 1.3B GGUF
Apache-2.0
Direct GGUF conversion version of Wan2.1-T2V-1.3B, suitable for text-to-video generation tasks on consumer-grade GPUs
Text-to-Video English
W
samuelchristlie
155
0
Wan2.1 VACE 14B GGUF
Apache-2.0
This is the GGUF quantized conversion version of the Wan-AI/Wan2.1-VACE-14B model, primarily designed for text-to-video generation tasks.
Text-to-Video
W
QuantStack
2,331
23
Moviigen1.1 GGUF
Apache-2.0
MoviiGen1.1 is a video generation model based on GGUF format conversion, supporting text-to-video tasks.
Video Processing
M
wsbagnsv1
3,522
18
Ltxv 13b 0.9.7 Distilled GGUF
Other
LTX-Video is a text-to-video generation model that supports creating video content from text or images.
Text-to-Video English
L
wsbagnsv1
6,208
19
Wan2.1 T2V 14B CausVid GGUF
Apache-2.0
This is a GGUF format conversion version based on the Wan-AI/Wan2.1-T2V-14B model, primarily used for text-to-video generation tasks.
Text-to-Video English
W
Njbx
190
0
Ltxv 13b 0.9.7 Dev GGUF
Other
GGUF quantized version of the 13b-0.9.7-dev variant based on Lightricks/LTX-Video, supporting text-to-video and image-to-video generation tasks.
Text-to-Video English
L
wsbagnsv1
25.99k
61
Ltxv0.9.6 Gguf
Other
GGUF quantized versions of the Lightricks/LTX-Video model, including development and distilled editions, designed for text-to-video generation tasks.
Text-to-Video English
L
calcuis
1,753
5
Skyreels V2 T2V 14B 540P GGUF
Other
SkyReels-V2 is a 14B-parameter text-to-video generation model that supports 540P resolution video generation.
Video Processing
S
wsbagnsv1
205
1
Wan2.1 Fun 14B Control Gguf
Apache-2.0
A 14B-parameter multimodal model released by Alibaba PAI, supporting text-to-video generation tasks
Text-to-Video Supports Multiple Languages
W
city96
5,120
10
Wan2.1 Fun 14B InP Gguf
Apache-2.0
A 14B-parameter multimodal model released by Alibaba PAI, supporting text-to-video generation tasks
Text-to-Video Supports Multiple Languages
W
city96
13.97k
18
Wan 1.3b Gguf
Apache-2.0
This is a GGUF quantized version based on Wan-AI/Wan2.1-T2V-1.3B, specifically designed for text-to-video generation tasks, compatible with comfyui-gguf and gguf nodes.
Text-to-Video English
W
calcuis
3,058
12
Ltxv0.9.5 Gguf
Other
LTX-Video is a model based on text-to-video generation technology, capable of generating corresponding video content based on input text descriptions.
Text-to-Video English
L
calcuis
337
5
Mochi Gguf
Apache-2.0
The GGUF quantized version of Mochi is a text-to-video generation model that includes a GGUF encoder and GGUF variational autoencoder, suitable for fast video content generation.
Text-to-Video English
M
calcuis
284
2
Wan Gguf
Apache-2.0
The GGUF quantized version of Wan Video is a text-to-video generation model suitable for older or low-end machines, supporting efficient inference via GGUF files.
Text-to-Video English
W
calcuis
26.46k
66
Wan2.1 T2V 14B Gguf
Apache-2.0
A text-to-video generation model converted to GGUF format, supporting usage via ComfyUI-GGUF custom nodes
Text-to-Video
W
city96
42.38k
130
Skyreels V1 Hunyuan I2V HFIE
Other
SkyReels-V1-Hunyuan-I2V is a text-to-video generation model developed by Tencent SkyworkAI, based on the Hunyuan architecture, supporting video content generation from text input.
Text-to-Video English
S
jbilcke-hf
21
4
Animatelcm
AnimateLCM is a diffusion model-based text-to-video generation system capable of producing high-quality short video clips from text descriptions.
Text-to-Video
A
chaowenguo
574
0
Mochi 1 Transformer 42
Apache-2.0
A distilled version of the genmoai mochi-1 model transformer, composed of 42 modules (original version has 48 modules), achieving lightweight through iterative removal of modules with the smallest MSE values
Text-to-Video English
M
NimVideo
62
3
Fasthunyuan Gguf
Other
The GGUF quantized version of FastHunyuan, designed for text-to-video generation tasks, requires integration with ComfyUI
Text-to-Video
F
city96
2,564
45
Hunyuanvideo HFIE
Other
Tencent Hunyuan Video is a text-to-video generation model, compatible with Hugging Face inference endpoints.
Text-to-Video English
H
jbilcke-hf
21
1
Mochi
Apache-2.0
Mochi is a text-to-video generation model based on the GGUF quantized version, supporting video content generation from text descriptions.
Text-to-Video English
M
calcuis
140
8
Hunyuan Gguf
Other
Tencent Hunyuan Community Edition's text-to-video model, capable of generating high-quality video content from text prompts.
Text-to-Video English
H
calcuis
1,871
61
Nova D48w1024 Osp480
Apache-2.0
A non-quantized autoregressive text-to-video model developed by Beijing Academy of Artificial Intelligence, capable of generating and editing videos based on text prompts
Text-to-Video
N
BAAI
314
6
Hunyuanvideo Gguf
Other
GGUF quantized version of Tencent's Phantom Video model, designed specifically for ComfyUI for text-to-video generation tasks
Text-to-Video
H
city96
6,142
162
Fasthunyuan
Other
FastHunyuan is the accelerated version of HunyuanVideo, requiring only 6 diffusion sampling steps to generate high-quality videos, achieving approximately an 8x speed improvement compared to the original version.
Text-to-Video
F
FastVideo
94
186
Hunyuan
Other
Hunyuan Video is a text-to-video generation model developed by Tencent.
Text-to-Video
H
FastVideo
106
1
Cogvideox 2B LiFT
MIT
CogVideoX-2B-LiFT is a text-to-video generation model fine-tuned from CogVideoX-1.5 using reward-weighted learning methods
Text-to-Video English
C
Fudan-FUXI
21
1
Seba Ai
MIT
A video generation model based on CogVideoX-5b, capable of producing high-quality video content from text descriptions
Text-to-Video English
S
GlitchXRiot
13
2
Zlikwidcogvideoxlora
Other
This is a LoRA weight model trained for THUDM/CogVideoX-2b, focusing on the text-to-video generation task.
Text-to-Video
Z
Zlikwid
1
0
Cogvideox 2b
Apache-2.0
CogVideoX is the open-source version of the video generation model from Qingying. The 2B version is an entry-level model that balances compatibility with low operational and development costs.
Text-to-Video English
C
rttrsabc
22
1
Vchitect 2.0 2B
Apache-2.0
Vchitect-2.0 is a parallel Transformer model for scaling video diffusion models, specializing in text-to-video and image-to-video generation tasks.
Video Processing
V
Vchitect
50
38
Animatediff Sparsectrl Scribble
AnimateDiff is a method that transforms static Stable Diffusion models into video generation models by inserting motion modules to achieve coherent video generation.
Text-to-Video
A
guoyww
247
8
Animatediff Sparsectrl Rgb
AnimateDiff is a method that utilizes existing Stable Diffusion text-to-image models to create videos by inserting motion module layers to achieve coherent motion between frames.
Text-to-Video
A
guoyww
166
8
Latte 1
Apache-2.0
Latte is a Transformer-based latent diffusion model focused on text-to-video generation tasks, supporting pre-trained weights for multiple datasets.
Text-to-Video
L
maxin-cn
1,027
19
Text To Video Lvd Zs
A generative model combining large language models and video diffusion technology, supporting bounding box conditional control
Text-to-Video
T
longlian
45
3
Animatediff Motion Adapter Sdxl V1 0 Beta
AnimateDiff is a method that allows the use of existing Stable Diffusion text-to-image models to create videos.
Text-to-Video
A
Warvito
65
3
Animatediff Motion Adapter V1 5 2
AnimateDiff is a method that enables the use of existing Stable Diffusion text-to-image models to create videos.
Text-to-Video
A
guoyww
1,153
25
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase