W

Wav2vec2 Xlsr Romansh Sursilvan

Developed by sammy786
This model is an automatic speech recognition model fine-tuned on the Romansh-Sursilvan dialect dataset based on facebook/wav2vec2-xls-r-1b, achieving a word error rate (WER) of 13.82% on the Common Voice 8 test set.
Downloads 18
Release Time : 3/2/2022

Model Overview

This is an optimized automatic speech recognition model for the Romansh-Sursilvan dialect, fine-tuned based on Facebook's wav2vec2-xls-r-1b architecture.

Model Features

Low Word Error Rate
Achieves a word error rate (WER) of 13.82% and a character error rate (CER) of 3.02% on the Romansh-Sursilvan dialect test set.
Fine-tuned on Large Model
Fine-tuned based on the facebook/wav2vec2-xls-r-1b large model, inheriting its powerful speech feature extraction capabilities.
Multi-dataset Training
Trained by combining multiple datasets including Common Voice Finnish train.tsv, dev.tsv, and other.tsv.

Model Capabilities

Romansh-Sursilvan dialect speech recognition
Robust speech event detection
Conversational speech processing

Use Cases

Speech Transcription
Romansh-Sursilvan Dialect Speech-to-Text
Converts Romansh-Sursilvan dialect speech content into text
Word error rate 13.82%, character error rate 3.02%
Voice Assistants
Romansh-Sursilvan Dialect Voice Assistant
Supports voice interaction systems in the Romansh-Sursilvan dialect
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase