W

Wav2vec2 Lv 60 Espeak Cv Ft

Developed by facebook
This model is based on the pre-trained Wav2Vec2-Large-LV60 model and fine-tuned on the CommonVoice dataset for multilingual phoneme recognition.
Downloads 18.77k
Release Time : 3/2/2022

Model Overview

This model is mainly used for multilingual phoneme recognition tasks and can convert speech input into phoneme labels. It needs to be used with a phoneme-to-word mapping dictionary.

Model Features

Multilingual support
Supports phoneme recognition in multiple languages
Fine-tuned on CommonVoice
Fine-tuned on the CommonVoice dataset to improve recognition accuracy
Phoneme-level recognition
Outputs phoneme labels and needs to be converted to words with a dictionary

Model Capabilities

Speech recognition
Phoneme recognition
Multilingual processing

Use Cases

Speech transcription
Multilingual speech transcription
Convert speech in multiple languages into phoneme labels
Can be further converted into text
Phonetic research
Phoneme analysis
Used to analyze the phoneme distribution and characteristics of different languages
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase