W

Wav2vec2 Xls R 300m Rm Sursilv D11

Developed by DrishtiSharma
This model is an automatic speech recognition model fine-tuned on the Romansh-Sursilvan dialect dataset based on facebook/wav2vec2-xls-r-300m, achieving a 24.09% Word Error Rate (WER) on the Common Voice 8 test set.
Downloads 20
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition model for the Romansh-Sursilvan dialect, fine-tuned based on the wav2vec2-xls-r-300m architecture, suitable for speech-to-text tasks.

Model Features

Low-resource language support
Specially optimized for the low-resource Romansh-Sursilvan dialect
High performance
Achieved a 24.09% Word Error Rate (WER) and 4.98% Character Error Rate (CER) on the Common Voice 8 test set
Based on XLS-R architecture
Uses Facebook's wav2vec2-xls-r-300m as the base model, with powerful speech feature extraction capabilities

Model Capabilities

Speech recognition
Speech-to-text
Romansh-Sursilvan dialect processing

Use Cases

Speech transcription
Romansh speech transcription
Convert speech content in the Romansh-Sursilvan dialect to text
Achieved 24.09% WER on the Common Voice 8 test set
Voice assistance technology
Romansh voice assistant
Develop voice-controlled applications for Romansh speakers
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase