X

Xls R 300m Et

Developed by TalTechNLP
An Estonian automatic speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, trained with approximately 800 hours of diverse data
Downloads 58
Release Time : 3/2/2022

Model Overview

This is a general-purpose Estonian ASR model, primarily used for speech recognition in scenarios such as broadcast dialogues, interviews, and lectures

Model Features

Diverse training data
Trained with approximately 800 hours of diverse Estonian data, including broadcast speech, spontaneous speech, elderly speech, and various other types
Excellent performance
Achieves WER of 12.5-13.4% and CER of 2.7-3.0% on test sets like Common Voice, demonstrating outstanding performance
Estonian-focused optimization
Specially optimized for Estonian, delivering better recognition performance compared to general multilingual models

Model Capabilities

Estonian speech recognition
Broadcast speech transcription
Lecture content transcription

Use Cases

Media content processing
Broadcast program transcription
Transcribing broadcast dialogues, interviews, and other content into text
WER 6.1-7.9%
Educational applications
Lecture content recording
Automatically transcribing lectures and speeches into text
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase