W

Wav2vec2 Xls R 1b Ca Lm

Developed by PereLluis13
This is a Catalan speech recognition model fine-tuned from facebook/wav2vec2-xls-r-300m, trained on multiple Catalan datasets.
Downloads 3,758
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) model for Catalan, fine-tuned on the Common Voice 8.0, tv3_parla, and parlament_parla datasets.

Model Features

Multi-dataset training
Trained on three Catalan datasets—Common Voice 8.0, tv3_parla, and parlament_parla—enhancing model robustness.
Data preprocessing optimization
Removed characters not present in the Catalan alphabet and converted numbers to their textual form, improving recognition accuracy.
High-performance results
Achieved excellent performance on multiple test sets, such as a WER of only 6.07% on the Common Voice 8.0 test set.

Model Capabilities

Catalan speech recognition
High-accuracy transcription
Multi-domain speech processing

Use Cases

Media transcription
TV program subtitle generation
Automatically generate subtitles for Catalan TV programs
Achieved a WER of 11.21% on the tv3_parla test set
Meeting transcription
Parliament meeting transcription
Automatically transcribe Catalan parliamentary meetings
Achieved a WER of 5.14% on the parlament_parla test set
Voice assistants
Catalan voice input
Provide speech recognition capabilities for Catalan voice assistants
Achieved a WER of 6.07% on the Common Voice test set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase