W

Wav2vec2 Large Xlsr 53 Swedish

Developed by KBLab
A Swedish automatic speech recognition model fine-tuned based on the facebook/wav2vec2-large-xlsr-53 framework, supporting 16kHz sampled audio input
Downloads 30.51k
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model specifically optimized for Swedish, based on the large-scale XLSR-53 architecture and fine-tuned on the Swedish NST dictation corpus and Common Voice dataset.

Model Features

High-performance Swedish recognition
Achieves a 14.3% word error rate and 4.93% character error rate on the Common Voice Swedish test set
Multi-stage training
Optimized through three stages: pre-training, incremental training, and final fine-tuning
No language model required
Can be used directly without additional language model support

Model Capabilities

Swedish speech recognition
Audio-to-text conversion
Speech processing

Use Cases

Speech transcription
Broadcast content transcription
Automatically transcribe Swedish radio programs into text
Voice command recognition
Recognize Swedish voice commands
Speech assistive technology
Accessibility applications
Provide real-time captioning services for the hearing impaired
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase