W

Wav2vec2 Base 960h 4 Gram

Developed by patrickvonplaten
Based on Facebook's Wav2Vec2-Base-960h model, with an added English 4-gram language model to improve automatic speech recognition (ASR) accuracy.
Downloads 19
Release Time : 4/12/2022

Model Overview

This model is a variant of Wav2Vec2, specifically designed for English automatic speech recognition tasks, with improved recognition accuracy through integration of a 4-gram language model.

Model Features

Integrated 4-gram language model
Uses the official Librispeech ngrams 4-gram.arpa.gz file to improve speech recognition accuracy.
Based on Wav2Vec2 architecture
Utilizes Facebook's Wav2Vec2-Base-960h model as the foundation, featuring robust speech feature extraction capabilities.

Model Capabilities

English speech recognition
High-accuracy speech-to-text

Use Cases

Speech transcription
Audio content transcription
Automatically converts English speech content into text
Achieves WER of 2.59-6.46 on the LibriSpeech test set
Voice assistants
Voice command recognition
Used for command recognition in voice assistant systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase