S

Sana 600M 512px

Developed by Efficient-Large-Model
Sana is an efficient text-to-image framework capable of generating images with resolutions up to 4096×4096, featuring fast synthesis of high-resolution, high-quality images
Downloads 2,853
Release Time : 11/30/2024

Model Overview

A text-to-image model based on a linear diffusion transformer, using Gemma2-2B-IT as the text encoder and DC-AE as the latent feature encoder

Model Features

High-Resolution Image Generation
Supports generating high-quality images with resolutions up to 4096×4096
Efficient Inference
Can run efficiently on laptop GPUs with fast inference speed
Strong Text-Image Alignment
Generated images closely match the input text prompts
Multi-Scale Support
Supports multi-scale height and width image generation based on 512px

Model Capabilities

Text-to-Image Generation
High-Resolution Image Synthesis
Multilingual Support

Use Cases

Artistic Creation
Artwork Generation
Used for image generation in artistic creation and design processes
Generates high-quality artworks
Educational Tools
Creative Educational Tools
Used for image generation in education or creative tools
Assists in teaching and creative expression
Research
Generative Model Research
Used to explore and understand the limitations and biases of generative models
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase