W

Wav2vec2 Large Xls R 300m German With Lm

Developed by mfleck
A speech recognition model fine-tuned on the Common Voice German dataset based on facebook/wav2vec2-xls-r-300m, integrated with an n-gram language model, achieving a word error rate of 8.8%
Downloads 26
Release Time : 3/10/2022

Model Overview

This model is an optimized automatic speech recognition (ASR) system for German, suitable for converting German speech to text.

Model Features

Language Model Enhancement
Integrated n-gram language model improves recognition accuracy
High Performance
Achieves a word error rate of 8.8% on the Common Voice evaluation set
Large-scale Pretraining
Fine-tuned based on the 300M-parameter XLS-R architecture

Model Capabilities

German Speech Recognition
Long Audio Processing (supports chunk processing)

Use Cases

Speech-to-Text
Meeting Minutes
Convert German meeting recordings into text transcripts
High-accuracy transcribed text
Media Subtitle Generation
Automatically generate subtitles for German video content
Supports 5-second audio chunk processing
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase