Steamboat-willie-14b Open-source Text-to-Video Generation Model - Generate Videos in the Golden Age Animation Style for Free

Steamboat Willie 14b

Developed by benjamin-paine

A text-to-video model trained on clips from 'Steamboat Willie', suitable for generating video content in the golden age animation style.

Text-to-Video #Retro animation style #Text-to-video generation #LoRA fine-tuning

Downloads 108

Release Time : 3/11/2025

Model Overview

This model, fine-tuned with LoRA technology, can generate animated videos in the style of 'Steamboat Willie' based on text descriptions, making it particularly suitable for retro animation style creations.

Model Features

Retro animation style

Capable of generating video content with the style of 'Steamboat Willie' and golden age animation.

LoRA fine-tuning

Uses LoRA technology for fine-tuning, maintaining the base model's capabilities while enabling specific style generation.

Text-to-video

Directly generates coherent video content based on text descriptions.

Model Capabilities

Text-to-video generation

Stylized video generation

Retro animation style rendering

Use Cases

Creative content production

Retro animation creation

Generates animated shorts in the style of 'Steamboat Willie' based on text prompts.

Produces video content that matches the golden age animation style.

Concept visualization

Quickly visualizes creative concepts into retro animation style videos.

Rapid prototyping and concept validation.

🚀 Steamboat Willie LoRA

This LoRA model enables text-to-video generation in the Steamboat Willie style of the golden era animation.

🚀 Quick Start

✨ Features

Text-to-Video Generation: Generate videos based on text prompts in the Steamboat Willie style.
LoRA Integration: Compatible with the Wan2.1-T2V series models.
Multiple Hosting Platforms: Available on Hugging Face and CivitAI.

📦 Installation

To use this model with the diffusers library, you need to install the latest version from the GitHub repository:

pip install git+https://github.com/huggingface/diffusers.git

💻 Usage Examples

Basic Usage

import torch
from diffusers.utils import export_to_video
from diffusers import AutoencoderKLWan, WanPipeline
from diffusers.schedulers.scheduling_unipc_multistep import UniPCMultistepScheduler

model_id = "Wan-AI/Wan2.1-T2V-14B-Diffusers"
vae = AutoencoderKLWan.from_pretrained(model_id, subfolder="vae", torch_dtype=torch.float32)
pipe = WanPipeline.from_pretrained(model_id, vae=vae, torch_dtype=torch.bfloat16)
pipe.scheduler = UniPCMultistepScheduler.from_config(
  pipe.scheduler.config,
  flow_shift=5.0
)
pipe.to("cuda")
pipe.load_lora_weights("benjamin-paine/steamboat-willie-14b")
pipe.enable_model_cpu_offload() # for low-vram environments

prompt = "steamboat willie style, golden era animation, an anthropomorphic cat character wearing a hat removes it and performs a courteous bow"
negative_prompt = "色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走"
output = pipe(
    prompt=prompt,
    negative_prompt=negative_prompt,
    height=720,
    width=1280,
    num_frames=81,
    guidance_scale=5.0,
    num_inference_steps=32
).frames[0]
export_to_video(output, "output.mp4", fps=16)

📚 Documentation

Model Description

The model is trained on clips from Steamboat Willie, which are split by scene and captioned using Qwen2.5-VL-3B-Instruct. It is also available for Wan2.1-T2V-1.3B and additionally hosted on CivitAI.

Trigger Words

The model was trained with the trigger phrase "steamboat willie style". The best results are achieved when using this trigger phrase combined with "golden era animation".

📄 License

The model is released under the cc0-1.0 license.

📦 Download Model

The weights for this model are available in Safetensors format. You can download them from the Files & versions tab.

📋 Model Information

Property	Details
Model Type	LoRA for text-to-video generation
Training Data	Clips from Steamboat Willie, captioned by Qwen2.5-VL-3B-Instruct
Base Model	Wan-AI/Wan2.1-T2V-14B
License	cc0-1.0

🚀 Widget Examples

You can try out the model using the following widget examples:

Prompt: steamboat willie style, golden era animation, a stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. She wears a black leather jacket, a long red dress, and black boots, and carries a black purse. She wears sunglasses and red lipstick. She walks confidently and casually. The street is damp and reflective, creating a mirror effect of the colorful lights. Many pedestrians walk about.
- Negative Prompt: 色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走
- Output: videos/t2v-1.webp
Prompt: steamboat willie style, golden era animation, close-up of a short fluffy monster kneeling beside a melting red candle. the mood is one of wonder and curiosity, as the monster gazes at the flame with wide eyes and open mouth. Its pose and expression convey a sense of innocence and playfulness, as if it is exploring the world around it for the first time. The use of warm colors and dramatic lighting further enhances the cozy atmosphere of the image.
- Negative Prompt: 色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走
- Output: videos/t2v-2.webp

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご