X

Xls R 300m Es

Developed by polodealvarado
A speech recognition model fine-tuned on the Spanish Common Voice dataset, based on the facebook/wav2vec2-xls-r-300m architecture, achieving a WER of 14.6% on the test set
Downloads 23
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model optimized for Spanish, implemented by fine-tuning the XLS-R-300M pre-trained model, suitable for Spanish speech-to-text tasks.

Model Features

High-Performance Spanish Recognition
Achieves a WER of 14.6% on the Common Voice 8.0 Spanish test set
5-gram Language Model Support
Built-in n-gram (n=5) language model support, further reducing WER to 10.9%
Optimized Training Configuration
Uses linear learning rate scheduling and mixed-precision training, optimized over 13 training epochs

Model Capabilities

Spanish Speech Recognition
Real-time Speech-to-Text
Long Audio Processing

Use Cases

Speech Transcription
Spanish Meeting Minutes
Automatically convert Spanish meeting recordings into text transcripts
Accuracy of 85.4% (WER 14.6)
Voice Assistant Development
Used for developing Spanish voice assistants and dialogue systems
Speech Analysis
Speech Content Analysis
Analyze Spanish speech content for sentiment analysis or keyword extraction
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase