W

Wav2vec2 Xlsr 1b Finnish Lm

Developed by Finnish-NLP
A Finnish automatic speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-1b, trained with 259.57 hours of annotated Finnish speech data, supporting Finnish speech-to-text tasks.
Downloads 32
Release Time : 3/28/2022

Model Overview

This is an automatic speech recognition model optimized for Finnish, fine-tuned based on the 1-billion-parameter Wav2Vec2 XLS-R architecture, suitable for short audio transcription. Includes a Finnish KenLM language model to enhance decoding performance.

Model Features

Large-scale pre-training foundation
Based on the XLS-R architecture pre-trained with 436,000 hours of multilingual speech data, featuring powerful acoustic feature extraction capabilities.
Domain-adapted fine-tuning
Fine-tuned with 259 hours of Finnish data, specifically optimized for parliamentary speeches and broadcast speech scenarios.
Language model enhancement
Includes a 5-gram KenLM language model, significantly improving transcription accuracy.
Efficient inference
Supports direct processing of 20-second short audio, with long audio processed via chunking methods.

Model Capabilities

Finnish speech recognition
Short audio transcription
Decoding with language model

Use Cases

Speech transcription
Parliament meeting minutes
Transcribing Finnish parliamentary speeches
Performs excellently on the Aalto Parliament dataset
Broadcast content transcription
Processing Finnish radio program audio
Achieves WER 5.65% on broadcast corpus
Educational applications
Language learning assistance
Helping learners correct Finnish pronunciation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase