W

Wav2vec2 Large Xls R 300m Sl With LM V2

Developed by DrishtiSharma
This is an automatic speech recognition (ASR) model fine-tuned on the Slovenian language (common_voice_8_0) dataset based on facebook/wav2vec2-xls-r-300m, supporting language model (LM) enhancement.
Downloads 26
Release Time : 3/2/2022

Model Overview

This model is specifically designed for Slovenian speech recognition tasks, demonstrating excellent performance on the Common Voice 8 dataset, with language model enhancement to improve recognition accuracy.

Model Features

Language model enhancement
Supports language model (LM) enhancement, significantly improving recognition accuracy (WER reduced from 0.217 to 0.146)
Multi-dataset validation
Comprehensively evaluated on Common Voice 8 and Robust Speech Event datasets
Efficient training
Optimized training process with mixed-precision training and linear learning rate scheduler

Model Capabilities

Slovenian speech recognition
Long audio processing (supports chunk processing)
Language model integration

Use Cases

Speech transcription
Speech-to-text
Convert Slovenian speech to text
Achieved WER 0.217 (without LM)/0.146 (with LM) on Common Voice 8 test set
Voice assistants
Slovenian voice command recognition
Used for voice assistant or voice control system command recognition
WER 46.69 on Robust Speech Event test set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase