W

Wav2vec2 Large Xls R 300m Hsb V1

Developed by DrishtiSharma
This is an automatic speech recognition model fine-tuned on the Upper Sorbian (HSB) dataset based on facebook/wav2vec2-xls-r-300m, achieving a word error rate (WER) of 0.4393 on the Common Voice 8 test set.
Downloads 20
Release Time : 3/2/2022

Model Overview

This model is specifically designed for automatic speech recognition tasks in Upper Sorbian, based on the wav2vec2 architecture and trained on the Mozilla Common Voice 8 dataset.

Model Features

Low-resource language support
A speech recognition model specifically optimized for low-resource languages like Upper Sorbian
Based on XLS-R architecture
Uses Facebook's wav2vec2-XLS-R-300M as the base model, featuring strong cross-lingual representation capabilities
Fine-tuned on Common Voice
Fine-tuned on the Upper Sorbian dataset from Mozilla Common Voice 8 to adapt to specific language features

Model Capabilities

Upper Sorbian speech recognition
Speech-to-text

Use Cases

Speech transcription
Upper Sorbian speech transcription
Convert Upper Sorbian speech content into text
Achieved a WER of 0.4393 on the Common Voice test set
Language preservation
Digitization of minority languages
Helps preserve and digitize minority languages like Upper Sorbian
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase