W

Wav2vec2 Large 100k Voxpopuli Catala

Developed by softcatala
A Catalan speech recognition model fine-tuned based on the VoxPopuli large model, trained on Common Voice and ParlamentParla datasets
Downloads 16
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model optimized for Catalan, capable of converting Catalan speech into text.

Model Features

Multi-dataset training
Combines Common Voice and ParlamentParla datasets for training, improving model generalization
Low word error rate
Achieves a 5.98% word error rate on test sets, demonstrating excellent performance
No language model required
Can be used directly without additional language model support

Model Capabilities

Speech recognition
Speech-to-text
Catalan language processing

Use Cases

Speech transcription
Parliament speech transcription
Convert recordings of Catalan parliamentary speeches into text
Performs well on the ParlamentParla dataset
Audiobook transcription
Convert Catalan audiobooks into text
Achieved a 12.02% word error rate in 'The Legend of Saint George' test
Voice assistants
Catalan voice command recognition
Speech recognition component for Catalan voice assistant systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase