Wav2vec2 Large Xlsr 53 842h Luxembourgish 14h
W
Wav2vec2 Large Xlsr 53 842h Luxembourgish 14h
Developed by Lemswasabi
A large wav2vec2.0 model fine-tuned with 842 hours of unlabeled and 14 hours of labeled Luxembourgish speech data, supporting Luxembourgish speech recognition
Downloads 204
Release Time : 5/21/2022
Model Overview
This model is an automatic speech recognition (ASR) model optimized for Luxembourgish, based on Facebook's wav2vec2.0 large XLSR-53 architecture. It was pre-trained on 842 hours of unlabeled data and fine-tuned on 14 hours of labeled data, with an integrated language model.
Model Features
Cross-lingual pretraining
Based on the XLSR-53 multilingual model, leveraging cross-lingual representations to enhance Luxembourgish recognition performance
Large-scale data training
Trained using 842 hours of unlabeled and 14 hours of labeled Luxembourgish data
Integrated language model
The model incorporates a language model (LM) to improve recognition accuracy
Low word error rate
Achieves a WER of 10.71% and a CER of 2.31% on the test set
Model Capabilities
Luxembourgish speech recognition
Audio-to-text conversion
Automatic speech transcription
Use Cases
Media transcription
Broadcast content transcription
Transcribing Luxembourgish broadcast content such as RTL.lu
Voice assistants
Luxembourgish voice interaction
Providing recognition capabilities for Luxembourgish voice assistants
Featured Recommended AI Models
Š 2025AIbase