W

Wav2vec2 Large English

Developed by jonatasgrosman
An automatic speech recognition model fine-tuned on English based on facebook/wav2vec2-large, trained using the Common Voice 6.1 dataset
Downloads 355
Release Time : 3/2/2022

Model Overview

A large wav2vec2 model optimized for English speech recognition tasks, supporting voice input with 16kHz sampling rate

Model Features

High-performance English recognition
Achieves 21.53% WER and 9.66% CER on the Common Voice English test set
Based on large pre-trained model
Fine-tuned from facebook/wav2vec2-large model with powerful speech feature extraction capabilities
16kHz sampling rate support
Optimized for voice input with 16kHz sampling rate

Model Capabilities

English speech recognition
Audio to text
Automatic speech transcription

Use Cases

Speech transcription
Automatic meeting minutes transcription
Automatically convert English meeting recordings into text transcripts
Approximately 80% accuracy (based on WER metric)
Podcast content transcription
Automatically convert English podcast episodes into text content
Voice assistants
English voice command recognition
For smart devices' English voice command recognition systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase