AnimateDiff-motion-adapter-v1-5-2 Open-source Model - Easily Create Videos with Text-to-Image Models

Animatediff Motion Adapter V1 5 2

Developed by guoyww

AnimateDiff is a method that enables the use of existing Stable Diffusion text-to-image models to create videos.

Text-to-Video #Text-to-Video Generation #Motion Module Adaptation #Stable Diffusion Extension

Downloads 1,153

Release Time : 11/1/2023

Model Overview

By inserting motion module layers into a frozen text-to-image model and training on video clips to extract motion priors, it introduces coherent motion between image frames.

Model Features

Motion Module Insertion

Inserts motion module layers after ResNet and attention blocks in Stable Diffusion UNet to achieve coherent motion between frames.

Adaptation for Existing Models

Provides convenient motion module support for existing Stable Diffusion models through MotionAdapter and UNetMotionModel.

High-Quality Video Generation

Generates high-quality, coherent video content using fine-tuned Stable Diffusion models.

Model Capabilities

Text-to-Video Generation

Image Sequence Generation

Motion Coherence Control

Use Cases

Creative Content Generation

Natural Scene Animation

Generates coherent animations of natural scenes such as sunsets and ocean waves.

Examples demonstrate smooth animation effects of sunset scenes.

Artistic Creation

Provides artists with tools to generate animations from text descriptions.

🚀 AnimateDiff: Transforming Text to Video with Stable Diffusion

AnimateDiff is a revolutionary approach that empowers users to generate videos using pre - existing Stable Diffusion text - to - image models. It inserts motion module layers into a frozen text - to - image model and trains it on video clips to extract a motion prior.

These motion modules are placed after the ResNet and Attention blocks in the Stable Diffusion UNet. Their main function is to introduce consistent motion across image frames. To facilitate the use of these modules, we introduce the concepts of a MotionAdapter and UNetMotionModel, which provide a convenient way to integrate these motion modules with existing Stable Diffusion models.

✨ Features

Leverage Existing Models: Utilize pre - trained Stable Diffusion text - to - image models to create videos.
Motion Modules: Insert motion modules into the UNet of Stable Diffusion to introduce coherent motion between frames.
Convenient Integration: Use MotionAdapter and UNetMotionModel to easily combine motion modules with existing models.

💻 Usage Examples

Basic Usage

import torch
from diffusers import MotionAdapter, AnimateDiffPipeline, DDIMScheduler
from diffusers.utils import export_to_gif

# Load the motion adapter
adapter = MotionAdapter.from_pretrained("guoyww/animatediff-motion-adapter-v1-5-2")
# load SD 1.5 based finetuned model
model_id = "SG161222/Realistic_Vision_V5.1_noVAE"
pipe = AnimateDiffPipeline.from_pretrained(model_id, motion_adapter=adapter)
scheduler = DDIMScheduler.from_pretrained(
    model_id, subfolder="scheduler", clip_sample=False, timestep_spacing="linspace", steps_offset=1
)
pipe.scheduler = scheduler

# enable memory savings
pipe.enable_vae_slicing()
pipe.enable_model_cpu_offload()

output = pipe(
    prompt=(
        "masterpiece, bestquality, highlydetailed, ultradetailed, sunset, "
        "orange sky, warm lighting, fishing boats, ocean waves seagulls, "
        "rippling water, wharf, silhouette, serene atmosphere, dusk, evening glow, "
        "golden hour, coastal landscape, seaside scenery"
    ),
    negative_prompt="bad quality, worse quality",
    num_frames=16,
    guidance_scale=7.5,
    num_inference_steps=25,
    generator=torch.Generator("cpu").manual_seed(42),
)
frames = output.frames[0]
export_to_gif(frames, "animation.gif")

📚 Documentation

The following table shows an example output of the AnimateDiff model:

Prompt	Output
masterpiece, bestquality, sunset.

💡 Usage Tip

AnimateDiff tends to work better with finetuned Stable Diffusion models. If you plan on using a scheduler that can clip samples, make sure to disable it by setting clip_sample=False in the scheduler as this can also have an adverse effect on generated samples.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご