X

Xlsr English

Developed by ashesicsis1
An English speech recognition model fine-tuned on the librispeech_asr dataset based on facebook/wav2vec2-xls-r-300m
Downloads 18
Release Time : 5/29/2022

Model Overview

This model is an XLS-R architecture optimized for English speech recognition tasks, achieving a low word error rate on the LibriSpeech dataset

Model Features

Low Word Error Rate
Achieves a word error rate of 0.1451 on the evaluation set, demonstrating excellent performance
Based on XLS-R Architecture
Utilizes facebook's wav2vec2-xls-r-300m pre-trained model as the foundation
Fine-Tuned
Optimized through 30 training epochs with linear learning rate scheduling

Model Capabilities

English Speech Recognition
Audio to Text Conversion
Large-Scale Speech Data Processing

Use Cases

Speech Transcription
Audiobook Transcription
Automatically convert English audiobooks into text
Highly accurate transcription results
Meeting Minutes
Automatically generate text records of English meetings
Assistive Technology
Hearing Assistance
Provide real-time speech-to-text services for the hearing impaired
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase