W

Wav2vec2 Large Xlsr 53 842h Luxembourgish 14h With Lm

Developed by Lemswasabi
A Luxembourgish speech recognition model fine-tuned from the wav2vec 2.0 large XLSR-53 checkpoint, trained with 842 hours of unlabeled and 14 hours of labeled data, integrated with a 5-gram language model
Downloads 170
Release Time : 5/24/2022

Model Overview

This model is an automatic speech recognition system for Luxembourgish, trained with large-scale unlabeled data and a small amount of labeled data, combined with a language model to improve recognition accuracy

Model Features

Cross-lingual Pretraining
Fine-tuned based on the XLSR-53 multilingual model, fully leveraging cross-lingual speech representations
Language Model Integration
Uses a 5-gram language model for output rescoring to improve recognition accuracy
Efficient Data Utilization
Combines 842 hours of unlabeled data and 14 hours of labeled data for training

Model Capabilities

Luxembourgish Speech Recognition
Audio to Text
Speech Transcription

Use Cases

Media Transcription
Broadcast Content Transcription
Transcribing Luxembourgish broadcast content such as RTL.lu
Word Error Rate 9.3%-9.5%
Voice Assistants
Luxembourgish Voice Interaction
Providing voice control features for Luxembourgish users
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase