X

Xls R 300 Sv Cv7

Developed by patrickvonplaten
This is an automatic speech recognition model fine-tuned on the Swedish Common Voice 7.0 dataset based on facebook/wav2vec2-xls-r-300m
Downloads 19
Release Time : 3/2/2022

Model Overview

This model is specifically designed for automatic speech recognition tasks in Swedish and performs excellently on the Common Voice 7.0 dataset

Model Features

High-performance Swedish recognition
Achieves a word error rate (WER) of 15.99% on the Common Voice 7.0 test set
Multi-dataset validation
Validated not only on Common Voice but also on robust speech event datasets
Based on XLS-R architecture
Uses facebook's wav2vec2-xls-r-300m as the base model

Model Capabilities

Swedish speech recognition
Long audio processing (supports chunk processing)

Use Cases

Speech-to-text
Swedish speech transcription
Convert Swedish speech content into text
WER 15.99% on Common Voice test set
Speech analysis
Speech event detection
Identify and analyze specific events in speech
WER 24.41% on robust speech event dataset
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase