U

Unispeech Sat Base Plus Timit Ft

Developed by patrickvonplaten
An automatic speech recognition (ASR) model fine-tuned on the TIMIT_ASR dataset based on microsoft/unispeech-sat-base-plus
Downloads 16
Release Time : 3/2/2022

Model Overview

This neural network model is optimized for English speech recognition tasks, particularly suitable for academic research and speech recognition system development

Model Features

Fine-tuned on TIMIT dataset
Specially fine-tuned on the standard TIMIT speech recognition dataset to optimize English speech recognition performance
Based on UniSpeech-SAT architecture
Utilizes Microsoft's UniSpeech-SAT base model with self-attention mechanisms for speech feature extraction
Progressive optimization training
Gradually reduces word error rate (WER) through 20 training epochs, achieving a final recognition accuracy of 0.4051

Model Capabilities

English speech recognition
Continuous speech-to-text
Speech feature extraction

Use Cases

Academic research
Speech recognition benchmarking
Can serve as a benchmark model on TIMIT dataset for comparative studies
Word error rate 0.4051
Speech technology development
Speech-to-text systems
Used for developing English speech recognition applications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase