W

Wav2vec2 Large Xlsr 53 Swedish

Developed by MehdiHosseiniMoghadam
This is an automatic speech recognition (ASR) model fine-tuned on the Swedish Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.
Downloads 24
Release Time : 3/2/2022

Model Overview

This model is specifically designed for Swedish speech recognition tasks, supporting the conversion of 16kHz sampled speech to text.

Model Features

Swedish optimization
Fine-tuned specifically for Swedish, improving the accuracy of Swedish speech recognition.
Based on wav2vec2 architecture
Utilizes Facebook's wav2vec2-large-xlsr-53 pre-trained model as the foundation.
16kHz sampling rate support
Supports processing speech input with a 16kHz sampling rate.

Model Capabilities

Swedish speech recognition
Speech-to-text

Use Cases

Speech transcription
Swedish speech transcription
Convert Swedish speech content into text
Achieves a WER of 41.39% on the Common Voice sv-SE test set.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase