Wav2vec2 Large Robust Pronounciation Evaluation
A pronunciation assessment model fine-tuned based on facebook/wav2vec2-large-robust for speech quality evaluation tasks
Downloads 242
Release Time : 6/26/2023
Model Overview
This model is a pronunciation assessment model fine-tuned on the wav2vec2-large-robust architecture, primarily used to evaluate speech pronunciation quality, capable of calculating metrics such as accuracy and F1 score
Model Features
High-Precision Pronunciation Assessment
Achieves 72% accuracy and F1 score on the test set
Based on wav2vec2-large-robust Architecture
Utilizes powerful pre-trained speech representation capabilities for fine-tuning
Multi-Metric Evaluation
Supports various evaluation metrics such as accuracy, F1 score, precision, and recall
Model Capabilities
Speech Quality Evaluation
Pronunciation Accuracy Analysis
Speech Feature Extraction
Use Cases
Language Learning
Foreign Language Pronunciation Assessment
Used to evaluate the pronunciation accuracy of foreign language learners
Can provide an evaluation accuracy of 72%
Speech Quality Detection
Speech Synthesis Quality Evaluation
Evaluates the quality of speech generated by TTS systems
Featured Recommended AI Models
Š 2025AIbase