P

Pixtral 12b Quantized.w8a8

Developed by RedHatAI
INT8 quantized version based on mgoin/pixtral-12b, supports vision-text multimodal tasks with optimized inference efficiency
Downloads 309
Release Time : 2/8/2025

Model Overview

This is a multimodal model with INT8 weight and activation quantization, supporting visual input and text output, suitable for image understanding and generation tasks

Model Features

Efficient INT8 Quantization
Both weight and activation quantization are INT8, significantly improving inference efficiency
Multimodal Support
Supports visual input and text output, capable of handling joint tasks involving images and text
vLLM Optimization
Optimized for vLLM inference engine, supporting efficient deployment
High Accuracy Retention
Maintains over 97% of the original model's accuracy after quantization

Model Capabilities

Visual Question Answering
Image Content Description
Document Understanding
Chart Analysis
Multimodal Reasoning

Use Cases

Visual Question Answering
Image Content Understanding
Answer natural language questions about image content
Achieves 78.00 accuracy on the VQAv2 validation set
Document Analysis
Document Question Answering
Extract information from scanned documents and answer questions
Achieves 89.35 ANLS score on the DocVQA validation set
Chart Understanding
Chart Data Analysis
Interpret chart content and answer related questions
Achieves 81.60 accuracy on the ChartQA test set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase