N

Nb Wav2vec2 300m Nynorsk

Developed by NbAiLab
A 300M-parameter speech recognition model fine-tuned on the VoxRex feature extractor, optimized for Nynorsk (New Norwegian), achieving a WER of 12.22% on the NPSC test set
Downloads 73.53k
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) system optimized for Nynorsk, built on the Wav2Vec2 architecture and fine-tuned on the Norwegian Parliamentary Speech Corpus (NPSC).

Model Features

Language model enhancement
Integration of a 5-gram KenLM language model reduces the word error rate (WER) by 20.5% relatively
Efficient training
Optimized parameter configuration enables model training on standard GPUs within 3-4 days
Multi-model support
Forms a Norwegian ASR solution matrix alongside the team's Bokmรฅl language model

Model Capabilities

Nynorsk speech-to-text conversion
Long audio segment processing (up to 30 seconds)
Low-resource language support

Use Cases

Government services
Automated parliamentary records
Automatic transcription of Norwegian parliamentary meeting recordings into text records
Test set character error rate as low as 4.19%
Education
Dialect preservation
Used for digital preservation of Nynorsk dialect materials
Featured Recommended AI Models
ยฉ 2025AIbase