W

Whisper Large V3 Ca 3catparla

Developed by projecte-aina
This is an automatic speech recognition model optimized for Catalan, fine-tuned based on OpenAI's Whisper-large-v3 and developed by the Barcelona Supercomputing Center.
Downloads 122
Release Time : 8/5/2024

Model Overview

This model is specifically designed for automatic speech recognition tasks in Catalan, capable of converting Catalan audio into unpunctuated plain text.

Model Features

High-precision Catalan recognition
Achieves a WER (Word Error Rate) of 0.96 on the 3CatParla test set
Multi-dialect support
Capable of recognizing different dialect variants of Catalan
Large-scale training data
Fine-tuned using 710 hours of Catalan data

Model Capabilities

Catalan audio transcription
Automatic speech recognition
Supports 16kHz sample rate audio processing

Use Cases

Speech transcription
Broadcast content transcription
Automatically transcribes Catalan broadcast programs into text
Achieves a WER of 0.96 on the 3CatParla test set
Dialect speech recognition
Recognizes different regional dialects of Catalan
WER ranges between 7.88-12.25 on different dialect test sets
Featured Recommended AI Models
ยฉ 2025AIbase