Q

Qwen2.5 VL 7B Instruct Quantized.w8a8

Developed by RedHatAI
Quantized version of Qwen2.5-VL-7B-Instruct, supporting vision-text input and text output, optimized for inference efficiency through INT8 weight quantization
Downloads 1,992
Release Time : 2/7/2025

Model Overview

A quantized model based on Qwen2.5-VL-7B-Instruct, designed for efficient vision-language tasks, suitable for applications requiring combined image understanding and text generation

Model Features

Efficient INT8 Quantization
Utilizes W8A8 quantization scheme, significantly improving inference efficiency while maintaining model performance
Multimodal Support
Capable of processing both visual and textual inputs, enabling joint tasks of image understanding and text generation
vLLM Optimization
Optimized for the vLLM inference engine, supporting efficient deployment and large-scale serving

Model Capabilities

Visual Question Answering
Image Caption Generation
Multimodal Reasoning
Document Understanding
Chart Analysis

Use Cases

Education
Textbook Content Understanding
Helps students understand charts and illustrations in textbooks
Achieves 52.33% accuracy on the MMMU benchmark
Business
Document Analysis
Automatically parses table and chart information in business documents
Achieves 94.09 ANLS score on the DocVQA benchmark
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase