W

Wav2vec2 Large Xls R 300m Hsb V2

Developed by DrishtiSharma
This is an automatic speech recognition (ASR) model fine-tuned on the Upper Sorbian (HSB) dataset based on Facebook's wav2vec2-xls-r-300m model.
Downloads 19
Release Time : 3/2/2022

Model Overview

This model is specifically designed for Upper Sorbian speech recognition tasks, fine-tuned on the Common Voice 8 dataset, capable of converting Upper Sorbian speech into text.

Model Features

Dedicated to Upper Sorbian
A speech recognition model specifically optimized for Upper Sorbian
Based on large-scale pre-trained model
Fine-tuned on Facebook's wav2vec2-xls-r-300m model with powerful speech feature extraction capabilities
Relatively high recognition accuracy
Achieves 46.5% word error rate (WER) and 11.4% character error rate (CER) on the Common Voice 8 test set

Model Capabilities

Upper Sorbian speech recognition
Speech-to-text
Automatic speech transcription

Use Cases

Speech transcription
Upper Sorbian speech transcription
Convert Upper Sorbian speech content into text
46.5% WER on Common Voice 8 test set
Language preservation
Digitization of minority languages
Helps preserve and digitize minority languages like Upper Sorbian
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase