Wav2vec2 Xls R Tf Left Right Shuru
A speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, achieving a word error rate (WER) of 1.2628 on the evaluation set.
Downloads 29
Release Time : 3/2/2022
Model Overview
This is a speech recognition model fine-tuned based on the wav2vec2-xls-r-300m architecture, suitable for speech-to-text tasks.
Model Features
Low Word Error Rate
Achieved a word error rate (WER) of 1.2628 on the evaluation set, demonstrating excellent performance.
Based on wav2vec2-xls-r Architecture
Utilizes facebook's wav2vec2-xls-r-300m as the base model, featuring powerful speech feature extraction capabilities.
Mixed Precision Training
Employs native AMP for mixed precision training, improving training efficiency.
Model Capabilities
Speech Recognition
Speech-to-Text
Use Cases
Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
Word error rate 1.2628
Voice Notes
Convert voice notes into editable text
Featured Recommended AI Models
Š 2025AIbase