U

Unispeech 1350 En 168 Es Ft 1h

Developed by microsoft
UniSpeech is a unified speech representation learning model that combines labeled and unlabeled data for pre-training, specifically fine-tuned for Spanish phoneme recognition.
Downloads 19
Release Time : 3/2/2022

Model Overview

This model is pre-trained on 16kHz sampled speech audio with phoneme labels and fine-tuned on 1 hour of Spanish phoneme data, primarily designed for phoneme classification tasks.

Model Features

Unified Representation Learning
Performs supervised phoneme CTC learning and phoneme-aware contrastive self-supervised learning simultaneously through multi-task learning.
Cross-lingual Capability
Demonstrates excellent cross-lingual representation learning performance on the CommonVoice corpus.
Strong Domain Adaptability
Achieves a 6% relative reduction in word error rate compared to previous methods in domain adaptation speech recognition tasks.

Model Capabilities

Speech recognition
Phoneme classification
Cross-lingual speech processing

Use Cases

Speech recognition
Spanish phoneme recognition
Converts Spanish speech into phoneme sequences
Compared to self-supervised pre-training and supervised transfer learning, it can reduce relative phoneme error rates by up to 13.4% and 17.8% respectively.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase