W

Whisper Large Onnx Int4 Inc

Developed by Intel
Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. This repository provides the Whisper large model in ONNX format with INT4 weight quantization, powered by Intel® Neural Compressor and Intel® Transformers Extension.
Downloads 44
Release Time : 10/8/2023

Model Overview

Whisper is a pre-trained model that demonstrates strong generalization capabilities after training on 680,000 hours of labeled data, adapting to various datasets and domains without fine-tuning. This model is the INT4 quantized version, suitable for automatic speech recognition inference.

Model Features

INT4 Quantization
The model undergoes INT4 weight quantization, significantly reducing model size (from 8.8GB to 1.9GB) while maintaining high performance.
ONNX Format
The model is provided in ONNX format, facilitating deployment and inference across different platforms.
High Performance
The quantized model achieves a word error rate of only 3.05% on the librispeech_asr dataset, nearly identical to the FP32 version (3.04%).
No Fine-tuning Required
The model exhibits strong generalization capabilities, adapting to various datasets and domains without fine-tuning.

Model Capabilities

Automatic Speech Recognition
Speech Translation

Use Cases

Speech Recognition
Speech-to-Text
Convert speech content into text, suitable for scenarios like meeting minutes and subtitle generation.
Word error rate 3.05%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase