Lumina Next SFT Diffusers
Lumina-Next-SFT is a 2-billion-parameter Next-DiT model that uses Gemma-2B as the text encoder and is enhanced through high-quality supervised fine-tuning (SFT) for text-to-image generation.
Downloads 8,442
Release Time : 6/20/2024
Model Overview
Lumina-Next-SFT is a text-to-image diffusion model based on the Next-DiT architecture, utilizing Gemma-2B as the text encoder to generate high-quality images from text descriptions.
Model Features
High-quality supervised fine-tuning
Enhanced model performance through high-quality supervised fine-tuning (SFT), improving the quality of generated images.
Efficient architecture
Utilizes Next-DiT backbone for faster image generation with lower memory consumption.
Powerful text understanding
Employs Gemma-2B as the text encoder, providing superior text comprehension capabilities.
High-resolution support
Supports image generation up to 2K resolution.
Model Capabilities
Text-to-image generation
High-resolution image generation
Complex scene understanding
Use Cases
Creative design
Concept art creation
Generate concept art for games or movies based on text descriptions.
Produces concept artwork with specific styles and details.
Content creation
Social media content generation
Generate accompanying images for social media posts.
Quickly generates visual images that match the text content.
Featured Recommended AI Models
Š 2025AIbase