W

Whisper Base

Developed by openai
Whisper is a pre-trained automatic speech recognition (ASR) and speech translation model, trained on 680k hours of labeled data with strong generalization capabilities.
Downloads 491.35k
Release Time : 9/26/2022

Model Overview

Whisper is a Transformer-based encoder-decoder model that supports speech recognition and translation tasks in multiple languages, adapting to different datasets and domains without fine-tuning.

Model Features

Large-scale Pre-training
Trained on 680k hours of labeled speech data with strong generalization capabilities
Multilingual Support
Supports speech recognition and translation tasks in 99 languages
Zero-shot Learning
Adapts to different datasets and domains without fine-tuning
Multitasking
Supports both speech recognition and speech translation tasks

Model Capabilities

English Speech Recognition
Multilingual Speech Recognition
Cross-language Speech Translation
Audio Transcription
Speech-to-Text

Use Cases

Speech Transcription
Meeting Minutes
Automatically transcribe meeting recordings into text records
WER of 5.01 on the LibriSpeech clean test set
Podcast Transcription
Convert podcast content into searchable text
Speech Translation
Real-time Translation
Translate speech in one language to text in another language in real-time
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase