W

Wav2vec2 Large Xlsr 53 Spanish

Developed by jonatasgrosman
A Spanish speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on the Common Voice 6.1 Spanish dataset
Downloads 46.28k
Release Time : 3/2/2022

Model Overview

An automatic speech recognition (ASR) model optimized for Spanish, supporting speech-to-text conversion for 16kHz sampled audio

Model Features

High-performance Spanish recognition
Achieves a word error rate (WER) of 8.82% on the Common Voice Spanish test set
Language model enhancement
With language model integration, WER can be reduced to 6.27%, significantly improving recognition accuracy
Based on XLSR-53 large model
Fine-tuned from facebook/wav2vec2-large-xlsr-53, featuring powerful speech feature extraction capabilities

Model Capabilities

Spanish speech recognition
Audio-to-text conversion
Supports 16kHz sample rate audio processing

Use Cases

Speech transcription
Speech content transcription
Convert Spanish speech content into text format
Highly accurate transcription results
Voice assistants
Spanish voice interaction
Provides speech recognition capabilities for Spanish voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase