W

Wav2vec2 Large Xlsr 53 842h Luxembourgish 14h

Developed by Lemswasabi
A large wav2vec2.0 model fine-tuned with 842 hours of unlabeled and 14 hours of labeled Luxembourgish speech data, supporting Luxembourgish speech recognition
Downloads 204
Release Time : 5/21/2022

Model Overview

This model is an automatic speech recognition (ASR) model optimized for Luxembourgish, based on Facebook's wav2vec2.0 large XLSR-53 architecture. It was pre-trained on 842 hours of unlabeled data and fine-tuned on 14 hours of labeled data, with an integrated language model.

Model Features

Cross-lingual pretraining
Based on the XLSR-53 multilingual model, leveraging cross-lingual representations to enhance Luxembourgish recognition performance
Large-scale data training
Trained using 842 hours of unlabeled and 14 hours of labeled Luxembourgish data
Integrated language model
The model incorporates a language model (LM) to improve recognition accuracy
Low word error rate
Achieves a WER of 10.71% and a CER of 2.31% on the test set

Model Capabilities

Luxembourgish speech recognition
Audio-to-text conversion
Automatic speech transcription

Use Cases

Media transcription
Broadcast content transcription
Transcribing Luxembourgish broadcast content such as RTL.lu
Voice assistants
Luxembourgish voice interaction
Providing recognition capabilities for Luxembourgish voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase