W

Wav2vec2 Large Xlsr 53 Spanish

Developed by facebook
A large-scale cross-lingual speech recognition model based on the Wav2Vec2 architecture, specifically optimized for Spanish, released by Facebook
Downloads 66.63k
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) model based on the Wav2Vec2 architecture, trained on the XLSR-53 dataset and specifically optimized for Spanish speech recognition tasks.

Model Features

Cross-lingual Pretraining
Trained on the XLSR-53 dataset with cross-lingual transfer learning capabilities
High Accuracy
Achieves a word error rate (WER) of 17.6% on the Common Voice Spanish test set
End-to-End Speech Recognition
Generates text output directly from raw audio input without complex feature engineering

Model Capabilities

Spanish speech-to-text
Continuous speech recognition
Audio feature extraction

Use Cases

Speech Transcription
Voice Memo Transcription
Automatically converts Spanish voice memos into text
Accuracy approximately 82.4%
Customer Service Call Logging
Automatically records and transcribes Spanish customer service calls
Assistive Technology
Voice-Controlled Interface
Provides voice control functionality for Spanish-speaking users
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase