# Low GPU Memory Usage
Moondream 2b 2025 04 14 4bit
Apache-2.0
Moondream is a lightweight vision-language model designed for efficient cross-platform deployment. The 4-bit quantized version released on April 14, 2025 significantly reduces memory usage while maintaining high accuracy.
Image-to-Text
Safetensors
M
moondream
6,037
38
Flux.1 Lite 8B
Other
Flux.1 Lite is an 8-billion-parameter Transformer model distilled from the FLUX.1-dev model, reducing memory usage by 7GB, increasing speed by 23%, while maintaining the original model's accuracy.
Text-to-Image
F
Freepik
11.17k
59
Featured Recommended AI Models