L

Latte 1

Developed by maxin-cn
Latte is a Transformer-based latent diffusion model focused on text-to-video generation tasks, supporting pre-trained weights for multiple datasets.
Downloads 1,027
Release Time : 6/3/2024

Model Overview

Latte is a latent diffusion model based on the Transformer architecture, primarily designed for text-to-video generation tasks. It supports generating high-quality video content from text input and provides pre-trained weights for various datasets.

Model Features

Text-to-Video Generation
Supports generating high-quality video content from text descriptions
Multi-Dataset Support
Provides pre-trained weights for multiple datasets including FaceForensics, SkyTimelapse, UCF101, and Taichi-HD
Transformer Architecture
Utilizes a Transformer-based latent diffusion model architecture
Text-to-Image Capability
The latest version Latte-1 also supports text-to-image generation

Model Capabilities

Text-to-Video Generation
Text-to-Image Generation

Use Cases

Video Creation
Creative Video Generation
Automatically generates creative video content based on text descriptions
Can produce high-quality video clips
Education
Educational Video Generation
Automatically generates demonstration videos based on teaching content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase