D

Dc Ae F32c32 Sana 1.1

Developed by mit-han-lab
DC-AE is a novel autoencoder architecture designed to accelerate high-resolution diffusion models, addressing reconstruction accuracy issues under high compression ratios
Downloads 18.17k
Release Time : 1/24/2025

Model Overview

This model significantly improves the spatial compression ratio of autoencoders while maintaining reconstruction quality through residual autoencoding and decoupled high-resolution adaptation techniques, greatly accelerating the training and inference processes of diffusion models

Model Features

High compression ratio
Supports spatial compression ratios up to 128×, far exceeding the 8× compression ratio of traditional autoencoders
Residual autoencoding
Learns residuals through spatial-channel transformation features, effectively alleviating optimization challenges under high compression ratios
Decoupled training strategy
Adopts a three-stage decoupled training strategy to mitigate the generalization penalty of high-compression autoencoders
Efficient acceleration
Achieves 19.1× inference acceleration and 17.9× training acceleration on the ImageNet 512×512 dataset

Model Capabilities

High-resolution image compression
Latent space feature extraction
Image reconstruction
Accelerating diffusion model training
Accelerating diffusion model inference

Use Cases

Computer vision
High-resolution image generation
Used to accelerate the training and inference processes of high-resolution diffusion models
Significantly improves speed while maintaining generation quality
Image compression and reconstruction
Achieves high-quality image reconstruction under high compression ratios
Maintains good reconstruction quality even at 128× compression ratio
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase