W

Wav2vec LnNor IPA Ft

Developed by MultiBridge
A phoneme recognition model fine-tuned based on wav2vec2-base, supporting English speech to International Phonetic Alphabet (IPA) conversion
Downloads 16
Release Time : 3/2/2025

Model Overview

This model was developed through fine-tuning on the TIMIT and LnNor datasets, specifically designed for phoneme recognition tasks, with predictions represented in the International Phonetic Alphabet (IPA).

Model Features

Multi-dataset fine-tuning
Trained on both TIMIT and LnNor datasets to enhance model generalization
IPA output
Direct output in International Phonetic Alphabet (IPA) representation, facilitating phonetic research
Pre-trained feature retention
Frozen encoder preserves useful pre-learned features from wav2vec2-base

Model Capabilities

English phoneme recognition
Speech to phoneme conversion
Automatic phonetic transcription

Use Cases

Speech processing
Automatic phonetic transcription
Convert raw speech into phoneme sequences
Speech processing component
Serve as a component or prototype in speech processing pipelines
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase