X

Xls R 2b Nl V2 Lm 5gram Os2 Hunspell

Developed by FremyCompany
A CTC model based on XLS-R with a 5-gram language model from Open Subtitles, primarily used for automatic speech recognition in Dutch and Flemish.
Downloads 18
Release Time : 3/2/2022

Model Overview

This model is a version of facebook/wav2vec2-xls-r-2b-22-to-16, fine-tuned mainly on the CGN dataset and the Dutch dataset from Common Voice 8.0, with the addition of a large 5-gram language model.

Model Features

High Accuracy Speech Recognition
Achieved high accuracy with WER 3.93 and CER 1.22 on the evaluation set of Common Voice 8.0.
Multi-language Support
Supports speech recognition for Dutch and its dialects (Belgian Dutch and Netherlands Dutch).
5-gram Language Model
A large 5-gram language model trained on the Open Subtitles Dutch corpus, significantly improving recognition accuracy.
Spelling Correction
Uses hunspell for spelling correction, further enhancing the accuracy of recognition results.

Model Capabilities

Dutch Speech Recognition
Belgian Dutch Speech Recognition
Netherlands Dutch Speech Recognition
High Accuracy Text Transcription

Use Cases

Speech-to-Text
Meeting Minutes
Convert Dutch or Flemish meeting recordings into text transcripts.
High accuracy transcription suitable for subsequent analysis and archiving.
Voice Assistant
Used as the speech recognition module for Dutch voice assistants.
Improves recognition accuracy and user experience for voice assistants.
Education
Language Learning
Helps learners practice Dutch pronunciation and receive instant feedback.
Provides accurate pronunciation assessment and text transcription.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase