W

Wav2vec2 Large Voxrex Swedish 4gram

Developed by viktor-enzell
This is a model for Swedish automatic speech recognition (ASR), combining the VoxRex-C acoustic model with a 4-gram language model based on social media data.
Downloads 5,891
Release Time : 5/26/2022

Model Overview

The model enhances the performance of the VoxRex-C acoustic model by adding a 4-gram language model based on the Swedish Culturomics Billion Word Corpus, specifically designed for Swedish speech recognition tasks.

Model Features

Enhanced language model
Incorporates a 4-gram language model based on 40 million social media words, significantly improving recognition accuracy.
High performance
Achieves a 6.47% word error rate on the Common Voice 6.1 test set.
Pre-trained acoustic model
Based on the VoxRex-C pre-trained model with excellent acoustic feature extraction capabilities.

Model Capabilities

Swedish speech recognition
Audio transcription
16kHz audio processing

Use Cases

Speech transcription
Social media audio transcription
Converts Swedish speech content from social media platforms into text.
Suitable for processing informal spoken expressions.
Voice assistants
Used as a speech recognition component for Swedish voice assistant applications.
High-accuracy voice command recognition.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase