N

Nanollava 1.5

Developed by qnguyen3
nanoLLaVA-1.5 is a vision-language model with under 1 billion parameters, designed specifically for edge devices—compact yet powerful.
Downloads 442
Release Time : 6/29/2024

Model Overview

nanoLLaVA-1.5 is an upgrade from v1.0, an efficient vision-language model suitable for image-text-to-text tasks.

Model Features

Compact yet Powerful
Designed for edge devices with under 1 billion parameters, yet highly capable.
Multimodal Support
Supports multimodal tasks involving vision and language.
Efficient Inference
Optimized to run efficiently even on edge devices.

Model Capabilities

Image caption generation
Visual question answering
Multimodal reasoning

Use Cases

Visual Question Answering
Image content description
Generate detailed textual descriptions based on images.
Education
Scientific question answering
Answer scientific questions based on images.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase