This is a German speech recognition model based on the wav2vec 2.0 XLS-R 1B architecture, incorporating TEVR (Token Entropy Variance Reduction) technology and combined with a 5-gram language model. It achieves a word error rate of 3.64% on the Common Voice German test set.
Speech Recognition
Transformers German