W

Wav2vec2 Large Xlsr Catala

Developed by softcatala
Catalan speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on Common Voice and parliamentary speech datasets
Downloads 64.30k
Release Time : 3/2/2022

Model Overview

This is a model for Catalan Automatic Speech Recognition (ASR), capable of converting Catalan speech into text.

Model Features

Multi-dataset training
Combined training on both Common Voice and parliamentary speech datasets, improving model generalization
Low Word Error Rate
Achieves 6.92% Word Error Rate (WER) on test sets, demonstrating excellent performance
No language model required
Can be used directly without additional language model support

Model Capabilities

Catalan speech recognition
Speech-to-text

Use Cases

Speech transcription
Parliamentary recording transcription
Convert parliamentary meeting recordings into text records
Performs well on parliamentary speech test sets
Audiobook transcription
Convert Catalan audiobooks into text
Achieves 13.23% WER on 'The Legend of Saint George' audiobook
Voice assistants
Catalan voice command recognition
Used for supporting Catalan-language voice assistants and smart devices
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase