AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Image Content Understanding

# Image Content Understanding

Typhoon2 Qwen2vl 7b Vision Instruct
Apache-2.0
Typhoon2-Vision is a Thai-supported visual language model capable of processing image and video inputs, specifically optimized for image-based applications.
Text-to-Image Transformers Supports Multiple Languages
T
scb10x
793
11
Vision 8B MiniCPM 2 5 Uncensored And Detailed 4bit
The int4 quantized version of MiniCPM-Llama3-V 2.5, significantly reducing GPU VRAM usage (approximately 9GB)
Text-to-Image Transformers
V
sdasd112132
330
30
Minicpm Llama3 V 2 5 Int4
The int4 quantized version of MiniCPM-Llama3-V 2.5 significantly reduces GPU VRAM usage to approximately 9GB, suitable for visual question answering tasks.
Text-to-Image Transformers
M
openbmb
17.97k
73
Tinyllava 1.1b V0.1
Apache-2.0
A lightweight visual question answering model based on TinyLlama-1.1B, trained using the BakLlava codebase
Text-to-Image Transformers
T
0xAmey
16
21
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase