Q

Qwen2.5 VL 3B Instruct Quantized.w8a8

Developed by RedHatAI
Quantized version of Qwen/Qwen2.5-VL-3B-Instruct, supporting visual-text input and text output, with weights quantized to INT8 and activations quantized to INT8.
Downloads 274
Release Time : 2/7/2025

Model Overview

This model is the quantized version of Qwen/Qwen2.5-VL-3B-Instruct, suitable for vision-language tasks and supporting efficient inference deployment.

Model Features

Efficient Quantization
Weights quantized to INT8 and activations quantized to INT8, significantly improving inference efficiency.
Multimodal Support
Supports visual and text inputs, suitable for complex multimodal tasks.
High-Performance Inference
Efficient deployment via vLLM backend, supporting single-stream and multi-stream asynchronous inference.

Model Capabilities

Visual-Text Understanding
Text Generation
Multimodal Reasoning

Use Cases

Visual Question Answering
Image Content Description
Generate descriptive text based on input images.
Achieved 75.55 accuracy on the VQAv2 dataset.
Document Understanding
Document Visual Question Answering
Parse document images and answer related questions.
Achieved 92.32 ANLS score on the DocVQA dataset.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase