W

Whisper Large V3.w4a16

Developed by nm-testing
This is the quantized version of openai/whisper-large-v3, employing INT4 weight quantization and FP16 activation quantization, suitable for vLLM inference.
Downloads 20
Release Time : 2/14/2025

Model Overview

This model is a quantized version of Whisper-large-v3, primarily used for speech recognition tasks to convert audio into text.

Model Features

Efficient Quantization
Utilizes INT4 weight quantization and FP16 activation quantization, significantly reducing model size and memory usage.
vLLM Compatibility
Optimized for vLLM >= 0.5.2, enabling efficient inference.
High Accuracy Retention
Maintains recognition accuracy close to the original model after quantization.

Model Capabilities

Speech Recognition
Audio to Text
English Transcription

Use Cases

Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
WER (Word Error Rate) approximately 12.95%
Podcast Transcription
Convert podcast audio content into searchable text
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase