W

Wav2vec2 Large Xlsr 53 842h Luxembourgish 4h

Developed by Lemswasabi
An automatic speech recognition model fine-tuned with 842 hours of unlabeled and 4 hours of labeled Luxembourgish speech data
Downloads 16
Release Time : 3/2/2022

Model Overview

This model is a Luxembourgish speech recognition model based on the wav2vec 2.0 large XLSR-53 architecture, pre-trained on 842 hours of unlabeled data and fine-tuned on 4 hours of labeled data.

Model Features

Cross-lingual speech representation
Utilizes the XLSR-53 multilingual pre-trained model as a foundation to effectively handle the low-resource Luxembourgish language
Efficient data utilization
Achieves good recognition performance using only 4 hours of labeled data
Two-stage training
Pre-trained on large-scale unlabeled data first, then fine-tuned on small-scale labeled data

Model Capabilities

Luxembourgish speech recognition
Speech-to-text

Use Cases

Speech transcription
Luxembourgish media content transcription
Automatically transcribes Luxembourgish radio and TV programs into text
Word error rate 18.77%
Voice assistant
Luxembourgish voice interaction
Develops localized voice assistants for the Luxembourg region
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase