Latte 1
Latte is a Transformer-based latent diffusion model focused on text-to-video generation tasks, supporting pre-trained weights for multiple datasets.
Downloads 1,027
Release Time : 6/3/2024
Model Overview
Latte is a latent diffusion model based on the Transformer architecture, primarily designed for text-to-video generation tasks. It supports generating high-quality video content from text input and provides pre-trained weights for various datasets.
Model Features
Text-to-Video Generation
Supports generating high-quality video content from text descriptions
Multi-Dataset Support
Provides pre-trained weights for multiple datasets including FaceForensics, SkyTimelapse, UCF101, and Taichi-HD
Transformer Architecture
Utilizes a Transformer-based latent diffusion model architecture
Text-to-Image Capability
The latest version Latte-1 also supports text-to-image generation
Model Capabilities
Text-to-Video Generation
Text-to-Image Generation
Use Cases
Video Creation
Creative Video Generation
Automatically generates creative video content based on text descriptions
Can produce high-quality video clips
Education
Educational Video Generation
Automatically generates demonstration videos based on teaching content
Featured Recommended AI Models