N

Nb Wav2vec2 1b Bokmaal

Developed by NbAiLab
Norwegian automatic speech recognition model fine-tuned based on Facebook/Meta's XLS-R feature extractor, achieving a 6.33% word error rate on the NPSC test set
Downloads 23.95k
Release Time : 3/2/2022

Model Overview

Automatic speech recognition model optimized for Bokmål Norwegian, fine-tuned on the 1B-parameter XLS-R architecture, supporting 16KHz audio input

Model Features

High-performance recognition
Achieves 6.33% word error rate on the NPSC test set (with language model)
Language model integration
Supports 5-gram KenLM language model enhancement, significantly improving recognition accuracy
Computational efficiency optimization
Complete training can be done in 3-4 days on a standard GPU, with multiple parameter optimization solutions provided

Model Capabilities

Norwegian speech-to-text
16KHz audio processing
Long audio segment processing (up to 30 seconds)

Use Cases

Speech transcription
Parliament meeting records
Automatically transcribe Norwegian parliamentary meeting audio
Improved WER from 17.10% to 5.81% compared to baseline model
Voice assistants
Norwegian voice command recognition
Provide voice interaction support for Norwegian smart devices
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase