S

SANA1.5 4.8B 1024px

Developed by Efficient-Large-Model
SANA-1.5 is an efficient text-to-image generation model based on the Linear Diffusion Transformer architecture, supporting 1024px high-resolution image generation.
Downloads 268
Release Time : 3/16/2025

Model Overview

SANA-1.5 is an efficient text-to-image model combining training-time and inference-time scaling techniques, featuring 4.8B parameters and supporting multi-scale aspect ratio image generation.

Model Features

Efficient Model Scaling
Scaling from 1.6B to 4.8B parameters with performance matching or surpassing full training, saving 60% training cost
Deep Pruning Support
Supports model size reduction to arbitrary dimensions
Inference Scaling Technique
Small model + inference scaling > large model
High-resolution Generation
Supports multi-scale aspect ratio image generation based on 1024px

Model Capabilities

Text-to-Image Generation
High-resolution Image Generation
Multi-scale Image Generation

Use Cases

Artistic Creation
Art Creation Assistance
Generate artworks based on text prompts
Produces images with artistic styles
Educational Tools
Creative Educational Tools
Develop creative tools for education
Helps students visualize learning content
Research
Generative Model Research
Study performance and limitations of generative models
Advances generative model technology
Featured Recommended AI Models
ยฉ 2025AIbase