Q

Qwen2.5 VL 72B Instruct FP8 Dynamic

Developed by parasail-ai
FP8 quantized version of Qwen2.5-VL-72B-Instruct, supporting vision-text input and text output, optimized and released by Neural Magic.
Downloads 78
Release Time : 4/18/2025

Model Overview

This is a quantized model based on Qwen2.5-VL-72B-Instruct, optimized through FP8 weight and activation quantization, suitable for multimodal task processing.

Model Features

FP8 Quantization
Utilizes FP8 weight and activation quantization technology, significantly reducing model size and memory usage
Multimodal Support
Capable of processing both visual and text inputs to perform complex multimodal tasks
Efficient Inference
Optimized for efficient inference under the vLLM framework, supporting single-stream and multi-stream deployment

Model Capabilities

Visual Question Answering
Image Caption Generation
Document Understanding
Multimodal Reasoning
Text Generation

Use Cases

Education
Educational Content Understanding
Analyzing charts and text content in educational materials
Achieved 66.88% accuracy in MMMU evaluation
Business
Document Analysis
Automatically understanding and extracting key information from business documents
Achieved 94.64% accuracy in DocVQA evaluation
General AI Assistant
Multimodal Dialogue
Engaging in natural conversations based on image and text inputs
Maintained 81.94% accuracy in VQAv2 evaluation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase