A

Asr Wav2vec2 Commonvoice 14 Es

Developed by speechbrain
This is an end-to-end automatic speech recognition system trained on the CommonVoice Spanish dataset, using the wav2vec 2.0 pre-trained model combined with a CTC decoder.
Downloads 22
Release Time : 8/9/2023

Model Overview

This model is used for Spanish speech recognition, consisting of a tokenizer and an acoustic model, capable of converting Spanish audio into text.

Model Features

End-to-end speech recognition
Provides a complete speech recognition pipeline, from audio input to text output.
Based on wav2vec 2.0 pre-trained model
Uses the facebook/wav2vec2-large-xlsr-53 pre-trained model as the foundation, offering robust acoustic feature extraction capabilities.
CTC decoder
Employs CTC (Connectionist Temporal Classification) as the decoder, suitable for sequence-to-sequence tasks.
No language model required
The system can perform speech recognition without relying on an external language model.

Model Capabilities

Spanish speech recognition
Audio transcription
16kHz mono audio processing

Use Cases

Speech transcription
Spanish speech-to-text
Convert Spanish speech content into text format.
Test word error rate: 13.28%, character error rate: 3.80%
Voice assistants
Spanish voice command recognition
Used for voice command recognition in Spanish voice assistants or smart home devices.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase