W

Wav2vec2 Large Xls R 300m Sr V4

Developed by DrishtiSharma
An automatic speech recognition model fine-tuned on Serbian (sr) dataset based on facebook/wav2vec2-xls-r-300m
Downloads 28
Release Time : 3/2/2022

Model Overview

This model is a wav2vec2 model optimized for Serbian automatic speech recognition (ASR) tasks, fine-tuned on the Common Voice 8 dataset, supporting Serbian speech-to-text tasks.

Model Features

Serbian Optimization
Specially fine-tuned for Serbian, performing well on the Common Voice 8 dataset
Based on Large Model
Built on Facebook's wav2vec2-xls-r-300m large model architecture with powerful speech feature extraction capabilities
Multi-scenario Evaluation
Evaluated on multiple datasets including Common Voice and Robust Speech Challenge

Model Capabilities

Serbian Speech Recognition
Speech-to-Text
Long Audio Processing (supports chunk processing)

Use Cases

Speech Transcription
Serbian Speech Transcription
Convert Serbian speech into text
Achieved a WER of 30.33% on the Common Voice 8 test set
Speech Recognition Systems
Voice Assistant
Used for Serbian voice assistant development
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase