S

Stable Video Diffusion Img2vid Xt

Developed by thingthatis
A diffusion model that generates short video clips from static images, supporting 25-frame video generation at 576x1024 resolution
Downloads 17
Release Time : 12/8/2023

Model Overview

This model is a latent diffusion model that generates short video clips by using static images as conditional frames. It is fine-tuned based on SVD image-to-video [14 frames], improving temporal consistency and resolution support.

Model Features

High-Resolution Support
Supports video generation at 576x1024 resolution
Long Video Generation
Can generate 25-frame video clips (approximately 4 seconds)
Temporal Consistency Optimization
Fine-tuned the f8 decoder to improve temporal consistency in generated videos

Model Capabilities

Generate videos from static images
High-resolution video generation
Maintain temporal consistency

Use Cases

Art Creation
Concept Art Animation
Convert static concept art into dynamic presentations
Generates dynamic presentation videos of about 4 seconds
Research
Generative Model Research
Research on image-to-video generation techniques
Content Safety Research
Research on safe deployment of models that may generate harmful content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase