W

Wav2vec2 Large Xls R 300m Sl With LM V1

Developed by DrishtiSharma
This is an automatic speech recognition (ASR) model fine-tuned on the Slovenian language (Common Voice 8.0) dataset based on the facebook/wav2vec2-xls-r-300m model, with improved recognition performance through language model (LM) integration.
Downloads 25
Release Time : 3/2/2022

Model Overview

This model is specifically designed for Slovenian speech recognition tasks and has achieved good recognition accuracy on the Common Voice 8.0 dataset.

Model Features

Language Model Enhancement
Integration with a language model (LM) significantly improves recognition accuracy, reducing WER from 20.6% to 13.5%
Multi-dataset Validation
Validated on multiple datasets including Common Voice and Robust Speech Events
Efficient Training
Optimized training process using mixed-precision training and linear learning rate schedulers

Model Capabilities

Slovenian speech recognition
Long audio processing (supports chunking)
High-accuracy character recognition (CER 3.8%)

Use Cases

Speech-to-Text
Speech Transcription
Convert Slovenian speech to text
WER 13.5% on the Common Voice test set
Voice Assistants
Voice Command Recognition
Recognize Slovenian voice commands
WER 46.17% on the Robust Speech Events test set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase