L

Lumina Next SFT Diffusers

Developed by Alpha-VLLM
Lumina-Next-SFT is a 2-billion-parameter Next-DiT model that uses Gemma-2B as the text encoder and is enhanced through high-quality supervised fine-tuning (SFT) for text-to-image generation.
Downloads 8,442
Release Time : 6/20/2024

Model Overview

Lumina-Next-SFT is a text-to-image diffusion model based on the Next-DiT architecture, utilizing Gemma-2B as the text encoder to generate high-quality images from text descriptions.

Model Features

High-quality supervised fine-tuning
Enhanced model performance through high-quality supervised fine-tuning (SFT), improving the quality of generated images.
Efficient architecture
Utilizes Next-DiT backbone for faster image generation with lower memory consumption.
Powerful text understanding
Employs Gemma-2B as the text encoder, providing superior text comprehension capabilities.
High-resolution support
Supports image generation up to 2K resolution.

Model Capabilities

Text-to-image generation
High-resolution image generation
Complex scene understanding

Use Cases

Creative design
Concept art creation
Generate concept art for games or movies based on text descriptions.
Produces concept artwork with specific styles and details.
Content creation
Social media content generation
Generate accompanying images for social media posts.
Quickly generates visual images that match the text content.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase