Wav2vec2 Base Voxpopuli Sv Swedish
A Swedish speech recognition model fine-tuned using NST and Common Voice data, based on Facebook's VoxPopuli-sv base model.
Downloads 38
Release Time : 3/2/2022
Model Overview
This model is a Wav2vec 2.0 model for Swedish automatic speech recognition (ASR), fine-tuned on the NST Swedish ASR database and Common Voice dataset.
Model Features
High-performance Swedish recognition
Achieves 5.62% WER on the NST test set and 19.15% WER on the Common Voice test set.
Multi-dataset training
Fine-tuned using the NST Swedish ASR database and Common Voice dataset.
No language model required
Can be used directly without additional language model support.
Model Capabilities
Swedish speech recognition
16kHz audio processing
Use Cases
Speech-to-text
Swedish speech transcription
Convert Swedish speech content into text
Achieves 5.62% word error rate on professional datasets
Voice assistant
Speech recognition component for Swedish voice assistant applications
Featured Recommended AI Models
Š 2025AIbase