X

Xlsr Wav2vec English

Developed by harshit345
An automatic speech recognition model fine-tuned on the Common Voice dataset for English, based on facebook/wav2vec2-large, supporting 16kHz sampled audio input.
Downloads 27
Release Time : 3/2/2022

Model Overview

This is a Wav2Vec2 model for English automatic speech recognition (ASR), fine-tuned and ready to use without additional language models.

Model Features

High Accuracy Recognition
Achieves 21.53% Word Error Rate and 9.66% Character Error Rate on the Common Voice English test set
No Language Model Required
Ready to use without additional language model support
16kHz Sampling Rate Support
Optimized specifically for 16kHz sampled audio input

Model Capabilities

English Speech Recognition
Audio Transcription
Automatic Speech-to-Text

Use Cases

Speech Transcription
Meeting Minutes
Automatically transcribe meeting recordings into text records
Podcast Transcription
Automatically convert English podcast content into text transcripts
Assistive Technology
Voice Control
Add voice control functionality to applications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase