wav2vec2-base-960h-timit-demo-colab Open-source Speech Recognition Model, Accurately Identify Speech Content!

Wav2vec2 Base 960h Timit Demo Colab

Developed by obokkkk

A speech recognition model fine-tuned based on facebook/wav2vec2-base-960h, achieving a 21.6% word error rate on the TIMIT dataset

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Speech Recognition #Fine-tuned Model #Low Word Error Rate

Downloads 20

Release Time : 4/22/2022

Model Overview

This is an automatic speech recognition (ASR) model for English speech recognition, fine-tuned based on the wav2vec2 architecture, suitable for speech-to-text tasks

Model Features

High Accuracy Speech Recognition

Achieves a 21.6% word error rate on the TIMIT evaluation set

Based on wav2vec2 Architecture

Utilizes powerful speech representation capabilities from self-supervised pre-training

Lightweight Model

The base version is relatively lightweight, suitable for deployment in various environments

Model Capabilities

English Speech Recognition

Speech-to-Text

Audio Content Transcription

Use Cases

Speech Transcription

Automated Meeting Minutes

Automatically convert English meeting recordings into text transcripts

Can achieve approximately 80% accuracy

Voice Command Recognition

Recognize user voice commands and convert them into executable commands

Education

Pronunciation Assessment

Analyze the pronunciation accuracy of English learners

Training Loss	Epoch	Step	Validation Loss	Wer
5.7805	4.0	500	3.0558	1.0
2.2936	8.0	1000	0.2937	0.3479
0.4155	12.0	1500	0.2108	0.2473
0.2439	16.0	2000	0.2313	0.2391
0.1617	20.0	2500	0.2003	0.2255
0.1443	24.0	3000	0.2175	0.2207
0.119	28.0	3500	0.2002	0.2160

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Base 960h Timit Demo Colab

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2-base-960h-timit-demo-colab

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License