X

Xls R 300m It Cv8

Developed by masapasa
This model is a speech recognition model fine-tuned on the Common Voice Swedish dataset based on facebook/wav2vec2-xls-r-300m, achieving a word error rate (WER) of 1.0286 on the evaluation set.
Downloads 19
Release Time : 3/2/2022

Model Overview

This is a model for Swedish automatic speech recognition (ASR), based on the Transformer architecture and specifically optimized for Swedish speech data.

Model Features

Low word error rate
Achieved a WER of 1.0286 on the Common Voice Swedish evaluation set, demonstrating excellent performance
Based on large-scale pre-trained model
Fine-tuned from facebook/wav2vec2-xls-r-300m, inheriting powerful speech feature extraction capabilities
Optimized for Swedish
Specifically fine-tuned using Swedish datasets for better recognition performance in Swedish

Model Capabilities

Swedish speech recognition
Speech-to-text
Robust speech event detection

Use Cases

Speech transcription
Swedish speech transcription
Convert Swedish speech content into text
Word error rate 1.0286
Voice assistants
Swedish voice interaction
Used for developing Swedish voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase