Wav2vec2 Xls R 300m Phoneme
A fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m, specialized in phoneme recognition tasks
Downloads 12.26k
Release Time : 5/19/2022
Model Overview
This model is a fine-tuned version of wav2vec2-xls-r-300m, specifically designed for phoneme recognition tasks. It achieved a character error rate (CER) of 0.1332 on the evaluation set.
Model Features
Efficient Phoneme Recognition
Optimized for phoneme recognition tasks, achieving a low character error rate on the evaluation set
Based on Large-scale Pretrained Model
Fine-tuned from the wav2vec2-xls-r-300m model, inheriting its powerful speech feature extraction capabilities
Optimized Training Configuration
Utilizes carefully tuned training parameters, including learning rate scheduling and gradient accumulation strategies
Model Capabilities
Speech Recognition
Phoneme Recognition
Audio Feature Extraction
Use Cases
Speech Processing
Speech to Phoneme
Convert speech signals into phoneme sequences
Character error rate 0.1332
Speech Analysis
Used for phoneme analysis in linguistic research
Featured Recommended AI Models
Š 2025AIbase