W

Wav2vec2 Large Lv60 Phoneme Timit English Timit 4k

Developed by excalibur12
English phoneme recognition model fine-tuned from facebook/wav2vec2-large-lv60, achieving a phoneme error rate of 10.53% on the TIMIT dataset
Downloads 306
Release Time : 6/17/2024

Model Overview

This model is optimized for English phoneme recognition tasks, particularly suitable for phoneme-level speech analysis

Model Features

Low phoneme error rate
Achieves a phoneme error rate of 10.53% on the TIMIT test set, demonstrating excellent performance
Detailed phoneme analysis
Provides detailed error analysis for various phoneme categories including vowels, stops, and fricatives
Based on wav2vec2 architecture
Utilizes facebook's advanced wav2vec2-large-lv60 model as the foundation

Model Capabilities

English phoneme recognition
Speech feature extraction
Phoneme-level error analysis

Use Cases

Speech research
Phoneme recognition research
Used for linguistic studies and speech recognition system development
10.53% phoneme error rate
Educational technology
Pronunciation assessment
Can be used for pronunciation accuracy evaluation in language learning applications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase