P

Phoneme Test 5 Sv

Developed by patrickvonplaten
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the MULTILINGUAL_LIBRISPEECH - German 10-hour dataset for German speech recognition tasks.
Downloads 17
Release Time : 3/2/2022

Model Overview

This is an optimized automatic speech recognition (ASR) model for German, fine-tuned based on the wav2vec2-xls-r-300m architecture, excelling in German speech recognition tasks.

Model Features

German optimization
Specially fine-tuned for German speech recognition tasks, demonstrating excellent performance on German datasets
Efficient training
Based on a 300M parameter base model, efficiently fine-tuned with limited data (10 hours)
Low word error rate
Achieves a word error rate (WER) of 0.1520 on the evaluation set, demonstrating outstanding performance

Model Capabilities

German speech recognition
Speech-to-text
Multilingual speech processing

Use Cases

Speech transcription
German meeting minutes
Automatically transcribe German meeting recordings into text
Highly accurate transcription results with only 15.2% word error rate
German voice assistant
Used as the speech recognition module for German voice assistants
Education
German learning applications
Help learners practice German pronunciation and listening
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase