Timit-5percent-supervised Open-source Speech Recognition Model - Achieve Efficient Speech Recognition with Training on Small Amounts of Data

Timit 5percent Supervised

Developed by Kuray107

A speech recognition model fine-tuned on the TIMIT dataset based on facebook/wav2vec2-large-lv60, using 5% of the data for supervised training

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Speech Recognition #Low-resource Fine-tuning #Wav2Vec2 Architecture

Downloads 31

Release Time : 3/2/2022

Model Overview

This model is a speech recognition model specifically optimized for English speech recognition tasks, achieving a low word error rate on the TIMIT dataset.

Model Features

Efficient Supervised Learning

Achieves good recognition performance using only 5% of the TIMIT dataset for training

Low Word Error Rate

Achieves a word error rate of 27.88% on the evaluation set

Based on wav2vec2 Architecture

Uses facebook's wav2vec2-large-lv60 as the base model, featuring powerful speech feature extraction capabilities

Model Capabilities

English Speech Recognition

Speech-to-Text

Continuous Speech Recognition

Use Cases

Speech Transcription

Meeting Minutes Transcription

Automatically transcribe English meeting recordings into text records

Achieves a 27.88% word error rate on the TIMIT test set

Voice Command Recognition

Recognize English voice commands

Training Loss	Epoch	Step	Validation Loss	Wer
5.3773	33.33	500	2.9693	1.0
1.4746	66.67	1000	0.5050	0.3359
0.1067	100.0	1500	0.5981	0.3054
0.0388	133.33	2000	0.6192	0.2712
0.0244	166.67	2500	0.6392	0.2776
0.018	200.0	3000	0.6615	0.2788

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Timit 5percent Supervised

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 timit-5percent-supervised

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License