Q

Qwen2.5 VL 72B Instruct FP8 Dynamic

Developed by RedHatAI
The FP8 quantized version of Qwen2.5-VL-72B-Instruct, supporting vision-text input and text output, suitable for multimodal tasks.
Downloads 1,837
Release Time : 2/6/2025

Model Overview

This model is a quantized version based on Qwen2.5-VL-72B-Instruct, optimized with FP8 weight and activation quantization, suitable for vLLM inference.

Model Features

FP8 Quantization
Both weight and activation quantization use FP8 data type to improve inference efficiency.
Multimodal Support
Supports vision and text input, capable of understanding and generating text related to images.
Efficient Inference
The optimized model enables efficient deployment on the vLLM backend, improving inference speed.

Model Capabilities

Visual Question Answering
Image Caption Generation
Multimodal Reasoning
Document Understanding
Chart Analysis

Use Cases

Education
Teaching Assistance
Analyze charts and images in textbooks to generate explanatory text.
Achieved a score of 66.88 in the MMMU evaluation
Business Analysis
Document Understanding
Automatically parse charts and data in business documents.
Achieved a score of 94.64 ANLS in the DocVQA evaluation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase