W

Wav2vec2 Large Voxrex Swedish

Developed by KBLab
A Swedish automatic speech recognition model fine-tuned based on the VoxRex large model, supporting 16kHz sampling rate audio input
Downloads 101.28k
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) system optimized for Swedish, based on Facebook's Wav2vec 2.0 architecture, and fine-tuned on Swedish broadcast, NST, and Common Voice datasets.

Model Features

High-performance Swedish recognition
Achieves 2.5% WER on NST+Common Voice test set and 8.49% WER on Common Voice test set
Supports language model enhancement
Using a 4-gram language model reduces WER from 8.49% to 7.37%
Multi-dataset training
Combined training on Swedish broadcast, NST, and Common Voice datasets

Model Capabilities

Swedish speech recognition
16kHz audio processing
Direct use without language model

Use Cases

Speech-to-text
Broadcast content transcription
Automatically convert Swedish broadcast content into text
Excellent performance on broadcast datasets
Voice assistant
Provide speech recognition capability for Swedish voice assistants
Featured Recommended AI Models
ยฉ 2025AIbase