Wav2vec2 Xls R Phoneme 300m Tr
W
Wav2vec2 Xls R Phoneme 300m Tr
Developed by patrickvonplaten
An automatic speech recognition model fine-tuned on the Turkish Common Voice dataset based on Facebook's wav2vec2-xls-r-300m model
Downloads 16
Release Time : 3/2/2022
Model Overview
This model is an automatic speech recognition (ASR) model optimized for Turkish, focusing on phoneme-level recognition tasks. It achieved a 16.64% phoneme error rate (PER) on the Common Voice Turkish evaluation set.
Model Features
Phoneme-level recognition
Focuses on phoneme-level speech recognition, suitable for applications requiring detailed speech analysis
Turkish language optimization
Specifically fine-tuned for Turkish, performing well on the Common Voice Turkish dataset
Based on XLS-R architecture
Utilizes Facebook's powerful wav2vec2-xls-r-300m architecture as the base model
Model Capabilities
Turkish speech recognition
Phoneme-level analysis
Speech-to-text
Use Cases
Speech transcription
Turkish speech-to-text
Convert Turkish speech content into text
Phoneme error rate 16.64%
Speech analysis
Phoneme research
Used for linguistic research and Turkish phoneme analysis
Featured Recommended AI Models
Š 2025AIbase