W

Wav2vec2 Large Xls R 300m Romansh Sursilvan

Developed by infinitejoy
Automatic speech recognition model fine-tuned on the Romansh Sursilvan dialect dataset based on facebook/wav2vec2-xls-r-300m
Downloads 15
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model for the Romansh Sursilvan dialect, fine-tuned on the XLS-R-300M architecture, achieving a 19.81% word error rate (WER) on the Common Voice 7 dataset.

Model Features

Low word error rate
Achieved 19.81% WER and 4.15% CER on the Romansh Sursilvan dialect test set
Based on XLS-R architecture
Uses the powerful XLS-R-300M as the base model with excellent speech representation capabilities
Optimized for low-resource languages
Specifically optimized for relatively low-resource languages like the Romansh Sursilvan dialect

Model Capabilities

Speech-to-text
Romansh Sursilvan dialect recognition
Continuous speech recognition

Use Cases

Speech transcription
Romansh speech transcription
Convert speech content in the Romansh Sursilvan dialect to text
19.81% word error rate, 4.15% character error rate
Voice assistants
Romansh voice command recognition
For supporting voice assistants and smart devices in Romansh
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase