W

Wav2vec2 Base 100h

Developed by facebook
Wav2Vec2 Base is an automatic speech recognition model pre-trained and fine-tuned on 16kHz sampled LibriSpeech audio for 100 hours.
Downloads 4,380
Release Time : 3/2/2022

Model Overview

This model achieves efficient speech recognition by learning powerful representations from speech audio and fine-tuning, particularly suitable for scenarios with limited annotated data.

Model Features

Efficient Speech Representation Learning
Learns powerful speech representations through latent space masking and quantization contrastive tasks.
Low Annotation Data Requirement
Achieves high performance with limited annotated data, surpassing previous state-of-the-art with just 1 hour of labeled data compared to 100-hour subsets.
High Accuracy
Achieves word error rates (WER) of 1.8/3.3 on the LibriSpeech test set.

Model Capabilities

Speech recognition
Audio-to-text conversion
English speech processing

Use Cases

Speech Transcription
Automatic Meeting Minutes Generation
Automatically converts meeting recordings into text transcripts
Word error rate of 6.1% on clean test set
Voice Assistants
Used as the speech recognition module for voice assistants
Word error rate of 13.5% on other test sets
Education
Language Learning Applications
Helps language learners practice pronunciation and listening
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase