W

Wav2vec2 Large Xlsr 53 Hsb

Developed by anuragshas
Upper Sorbian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz audio input
Downloads 23
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model optimized for Upper Sorbian, based on Wav2Vec2 architecture and fine-tuned using the Common Voice dataset.

Model Features

Multilingual pretraining foundation
Fine-tuned from XLSR-53 multilingual pretrained model with cross-lingual transfer learning capability
Low-resource language support
Specifically optimized for low-resource languages like Upper Sorbian, suitable for minority language speech recognition scenarios
End-to-end recognition
Direct speech-to-text conversion without requiring a language model

Model Capabilities

Speech recognition
Audio-to-text conversion
Upper Sorbian speech processing

Use Cases

Speech transcription
Upper Sorbian speech transcription
Convert Upper Sorbian speech content into text
Word Error Rate (WER) 65.05%
Language preservation
Minority language digitization
Assist in preserving and digitizing minority languages like Upper Sorbian
Featured Recommended AI Models
ยฉ 2025AIbase