W

Wav2vec2 Base Voxpopuli Sv Swedish

Developed by KBLab
A Swedish speech recognition model fine-tuned using NST and Common Voice data, based on Facebook's VoxPopuli-sv base model.
Downloads 38
Release Time : 3/2/2022

Model Overview

This model is a Wav2vec 2.0 model for Swedish automatic speech recognition (ASR), fine-tuned on the NST Swedish ASR database and Common Voice dataset.

Model Features

High-performance Swedish recognition
Achieves 5.62% WER on the NST test set and 19.15% WER on the Common Voice test set.
Multi-dataset training
Fine-tuned using the NST Swedish ASR database and Common Voice dataset.
No language model required
Can be used directly without additional language model support.

Model Capabilities

Swedish speech recognition
16kHz audio processing

Use Cases

Speech-to-text
Swedish speech transcription
Convert Swedish speech content into text
Achieves 5.62% word error rate on professional datasets
Voice assistant
Speech recognition component for Swedish voice assistant applications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase