W

Wav2vec2 Xls R 300m Es

Developed by samitizerxu
This model is a fine-tuned Spanish automatic speech recognition model based on facebook/wav2vec2-xls-r-300m on the COMMON_VOICE - ES dataset.
Downloads 23
Release Time : 3/2/2022

Model Overview

A fine-tuned model for Spanish automatic speech recognition, based on the wav2vec2-xls-r-300m architecture, trained on a general speech dataset.

Model Features

Multi-dataset evaluation
Comprehensively evaluated on Common Voice 7 and Robust Speech Event datasets
Medium-sized model
Based on the 300M-parameter wav2vec2-xls-r architecture, balancing performance and efficiency
Spanish optimization
Specifically fine-tuned for Spanish speech recognition tasks

Model Capabilities

Spanish speech recognition
Continuous speech-to-text
Multi-scenario speech processing

Use Cases

Speech transcription
Spanish speech-to-text
Convert Spanish speech content into text
Achieved 37.37% WER on the Common Voice 7 test set
Voice assistant
Spanish voice command recognition
Recognize and understand Spanish voice commands
Achieved 57.28% WER on the Robust Speech Event test set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase