W

Wav2vec2 Xls R Sl A1

Developed by DrishtiSharma
This is an automatic speech recognition (ASR) model fine-tuned on the Slovenian language (Common Voice 8.0) dataset based on facebook/wav2vec2-xls-r-300m.
Downloads 25
Release Time : 3/2/2022

Model Overview

This model is specifically designed for Slovenian speech-to-text tasks and performs excellently on the Common Voice 8.0 dataset.

Model Features

High-precision Slovenian recognition
Achieves a word error rate (WER) of 20.63% and a character error rate (CER) of 5.16% on the Common Voice 8.0 test set.
Based on a powerful foundation model
Fine-tuned on the facebook/wav2vec2-xls-r-300m model, inheriting its excellent speech feature extraction capabilities.
Multi-dataset support
Evaluated on multiple datasets including Common Voice and Robust Speech Events.

Model Capabilities

Slovenian speech recognition
Long audio processing (supports chunking)
Conversational speech-to-text

Use Cases

Speech transcription
Slovenian speech-to-text
Convert Slovenian speech content into text
WER of 20.63% on the Common Voice test set
Voice assistants
Slovenian voice command recognition
Used for front-end speech recognition in Slovenian voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase