W

Wav2vec2 Large Xlsr 53 Spanish Ep5 944h

Developed by carlosdanielhernandezmena
An acoustic model for Spanish automatic speech recognition, fine-tuned for 5 epochs based on facebook/wav2vec2-large-xlsr-53 using approximately 944 hours of Spanish data.
Downloads 111
Release Time : 12/1/2022

Model Overview

This model is specifically designed for Spanish speech recognition, fine-tuned on a large-scale Spanish dataset, suitable for various Spanish speech recognition scenarios.

Model Features

Multi-dataset training
Trained using approximately 944 hours of Spanish data from the CIEMPIESS-UNAM project and other public repositories
Low WER
Excellent performance on multiple test sets, such as a WER of 9.20% on the Mozilla Common Voice 10.0 test set
Dialect coverage
Training data includes various Spanish dialects, such as those from Mexico, Chile, Colombia, Peru, Argentina, and Puerto Rico

Model Capabilities

Spanish speech recognition
Multi-dialect recognition
High-precision transcription

Use Cases

Speech transcription
Broadcast news transcription
Used for transcribing Spanish broadcast news content
WER of 7.48% on the HUB4NE test set
Telephone speech transcription
Used for transcribing telephone conversation content
WER of 39.12% on the CALLHOME test set
Voice assistants
Spanish voice command recognition
Used for command recognition in Spanish voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase