W

Wav2vec2 Large Robust Pronounciation Evaluation

Developed by hafidikhsan
A pronunciation assessment model fine-tuned based on facebook/wav2vec2-large-robust for speech quality evaluation tasks
Downloads 242
Release Time : 6/26/2023

Model Overview

This model is a pronunciation assessment model fine-tuned on the wav2vec2-large-robust architecture, primarily used to evaluate speech pronunciation quality, capable of calculating metrics such as accuracy and F1 score

Model Features

High-Precision Pronunciation Assessment
Achieves 72% accuracy and F1 score on the test set
Based on wav2vec2-large-robust Architecture
Utilizes powerful pre-trained speech representation capabilities for fine-tuning
Multi-Metric Evaluation
Supports various evaluation metrics such as accuracy, F1 score, precision, and recall

Model Capabilities

Speech Quality Evaluation
Pronunciation Accuracy Analysis
Speech Feature Extraction

Use Cases

Language Learning
Foreign Language Pronunciation Assessment
Used to evaluate the pronunciation accuracy of foreign language learners
Can provide an evaluation accuracy of 72%
Speech Quality Detection
Speech Synthesis Quality Evaluation
Evaluates the quality of speech generated by TTS systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase