W

Wav2vec2 Large Xls R 1b Swedish

Developed by kingabzpro
This model is an automatic speech recognition model fine-tuned on the Common Voice Swedish dataset based on facebook/wav2vec2-xls-r-1b, supporting Swedish speech-to-text tasks.
Downloads 844
Release Time : 3/2/2022

Model Overview

An automatic speech recognition model optimized for Swedish, based on the wav2vec2-xls-r-1b architecture, fine-tuned on the Common Voice 8.0 dataset, supporting high-precision Swedish speech recognition.

Model Features

High-performance Swedish Recognition
Achieves a word error rate (WER) of 14.04% and a character error rate (CER) of 4.86% on the Common Voice Swedish test set.
Fine-tuned on Large Model
Fine-tuned on the 1-billion-parameter wav2vec2-xls-r-1b model, featuring powerful speech feature extraction capabilities.
Supports Language Model Integration
Can be combined with a language model to further improve recognition accuracy, reducing WER by approximately 4% compared to no language model.

Model Capabilities

Swedish speech recognition
Speech-to-text
Long audio processing (supports chunk processing)

Use Cases

Speech Transcription
Swedish Speech Content Transcription
Convert Swedish speech content into text format
Achieves 14.04% WER on the Common Voice test set
Voice Assistants
Swedish Voice Command Recognition
Used for command recognition in Swedish voice assistant systems
Achieves 29.69% WER on the Robust Speech Events dataset
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase