W

Wav2vec2 Large Xlsr 53 Hungarian

Developed by jonatasgrosman
This is a fine-tuned XLSR-53 large model for Hungarian speech recognition tasks, trained on Common Voice and CSS10 datasets.
Downloads 127.73k
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) model fine-tuned on Hungarian based on facebook/wav2vec2-large-xlsr-53, supporting voice input with a 16kHz sampling rate.

Model Features

High-performance Hungarian Recognition
Achieves 31.40% WER and 6.20% CER on the Common Voice Hungarian test set, outperforming similar models
Based on XLSR-53 Architecture
Utilizes a large-scale pre-trained model for cross-lingual speech representation learning through fine-tuning
No Language Model Required
Can be used directly without additional language model support

Model Capabilities

Hungarian speech recognition
16kHz audio processing

Use Cases

Speech-to-Text
Speech Transcription
Convert Hungarian speech content into text
Approximately 93.8% accuracy (based on CER)
Voice Assistants
Hungarian Voice Command Recognition
Used for front-end speech recognition in Hungarian voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase