W

Wav2vec2 Large Xlsr Catala

Developed by ccoreilly
Catalan automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53
Downloads 31
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) model optimized for Catalan, fine-tuned using the Common Voice and ParlamentParla datasets, supporting 16kHz sampling rate audio input.

Model Features

Multi-dataset fine-tuning
Trained with both Common Voice and ParlamentParla datasets to enhance model adaptability
Low word error rate
Achieves a word error rate (WER) of 6.92% on the test set, demonstrating excellent performance
No language model required
Can be used directly without additional language model support

Model Capabilities

Speech recognition
Catalan speech-to-text
16kHz audio processing

Use Cases

Speech transcription
Parliament speech transcription
Convert Catalan parliamentary speeches into text
Performs well on the ParlamentParla dataset
Audiobook transcription
Convert Catalan audiobook content into text
Achieves a WER of 13.23% on the audiobook 'The Legend of Saint George'
Voice assistants
Catalan voice command recognition
For Catalan voice assistant systems
Featured Recommended AI Models
ยฉ 2025AIbase