W

Wav2vec2 Large Xlsr German

Developed by maxidl
An automatic speech recognition (ASR) model fine-tuned on the Common Voice German dataset based on Facebook's wav2vec2-large-xlsr-53 model.
Downloads 253
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition model optimized for German, capable of converting German speech into text, suitable for application scenarios that require speech-to-text conversion.

Model Features

High-precision German recognition
Achieved a WER (Word Error Rate) of 12.77% on the Common Voice German test set.
Based on the XLSR architecture
Uses facebook/wav2vec2-large-xlsr-53 as the base model, with powerful speech feature extraction capabilities.
No need for a language model
Can be used directly without additional language model support.

Model Capabilities

German speech recognition
16kHz audio processing
Batch speech-to-text conversion

Use Cases

Speech transcription
German meeting records
Automatically convert German meeting recordings into text records.
Accuracy of approximately 87.23% (based on 12.77% WER)
Voice assistant
Provide speech recognition capabilities for German voice assistants.
Education
Language learning application
Help learners practice German pronunciation and listening.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase