W

Wav2vec2 Xlsr 1b Finnish Lm

Developed by aapot
A Finnish automatic speech recognition (ASR) model fine-tuned based on Facebook's wav2vec2-xls-r-1b model, trained with 259.57 hours of Finnish annotated data
Downloads 19
Release Time : 3/2/2022

Model Overview

This model is a speech-to-text model optimized for Finnish, utilizing the XLS-R architecture with 1 billion parameters, combined with the KenLM language model to improve recognition accuracy

Model Features

Large-scale pretraining foundation
Based on the XLS-R architecture pretrained with 436,000 hours of multilingual speech data
High-accuracy Finnish recognition
Achieves 5.65% WER and 1.2% CER on the Common Voice test set
Language model enhancement
Includes a specially trained Finnish KenLM 5-gram language model
Efficient training
Uses 8-bit Adam optimizer and mixed-precision training techniques

Model Capabilities

Finnish speech-to-text
Short audio transcription (≤20 seconds)
Improved recognition accuracy with language model

Use Cases

Speech transcription
Meeting minutes transcription
Convert Finnish meeting recordings into text transcripts
Suitable for formal occasions such as parliamentary speeches
Voice assistant
Provide ASR support for Finnish voice interaction systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase