Wav2vec2 Large Xlsr 53 842h Luxembourgish 4h
W
Wav2vec2 Large Xlsr 53 842h Luxembourgish 4h
Developed by Lemswasabi
An automatic speech recognition model fine-tuned with 842 hours of unlabeled and 4 hours of labeled Luxembourgish speech data
Downloads 16
Release Time : 3/2/2022
Model Overview
This model is a Luxembourgish speech recognition model based on the wav2vec 2.0 large XLSR-53 architecture, pre-trained on 842 hours of unlabeled data and fine-tuned on 4 hours of labeled data.
Model Features
Cross-lingual speech representation
Utilizes the XLSR-53 multilingual pre-trained model as a foundation to effectively handle the low-resource Luxembourgish language
Efficient data utilization
Achieves good recognition performance using only 4 hours of labeled data
Two-stage training
Pre-trained on large-scale unlabeled data first, then fine-tuned on small-scale labeled data
Model Capabilities
Luxembourgish speech recognition
Speech-to-text
Use Cases
Speech transcription
Luxembourgish media content transcription
Automatically transcribes Luxembourgish radio and TV programs into text
Word error rate 18.77%
Voice assistant
Luxembourgish voice interaction
Develops localized voice assistants for the Luxembourg region
Featured Recommended AI Models
Š 2025AIbase