W

Wav2vec2 Large Xlsr 53 Es

Developed by pcuenq
A speech recognition model fine-tuned on the Spanish Common Voice dataset based on Facebook's wav2vec2-large-xlsr-53 model, with a test WER of 10.50%.
Downloads 147
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model optimized for Spanish, capable of converting Spanish speech into text.

Model Features

Low word error rate
Achieves a WER of 10.50% on the Common Voice Spanish test set.
Preserves diacritics
Retains diacritical marks in Spanish to ensure semantic accuracy.
No language model required
Can be used directly without additional language model support.
Multi-stage training
Employs a phased training strategy to progressively optimize model performance.

Model Capabilities

Spanish speech recognition
16kHz audio processing
Batch speech-to-text conversion

Use Cases

Speech transcription
Spanish speech-to-text
Convert Spanish speech content into text format
Approximately 89.5% accuracy (WER 10.5%)
Voice assistants
Spanish voice command recognition
Basic recognition component for Spanish voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase