Wav2vec2 Large Xlsr 53 Spanish
A large-scale cross-lingual speech recognition model based on the Wav2Vec2 architecture, specifically optimized for Spanish, released by Facebook
Downloads 66.63k
Release Time : 3/2/2022
Model Overview
This model is an automatic speech recognition (ASR) model based on the Wav2Vec2 architecture, trained on the XLSR-53 dataset and specifically optimized for Spanish speech recognition tasks.
Model Features
Cross-lingual Pretraining
Trained on the XLSR-53 dataset with cross-lingual transfer learning capabilities
High Accuracy
Achieves a word error rate (WER) of 17.6% on the Common Voice Spanish test set
End-to-End Speech Recognition
Generates text output directly from raw audio input without complex feature engineering
Model Capabilities
Spanish speech-to-text
Continuous speech recognition
Audio feature extraction
Use Cases
Speech Transcription
Voice Memo Transcription
Automatically converts Spanish voice memos into text
Accuracy approximately 82.4%
Customer Service Call Logging
Automatically records and transcribes Spanish customer service calls
Assistive Technology
Voice-Controlled Interface
Provides voice control functionality for Spanish-speaking users
Featured Recommended AI Models