Wav2Vec2 Speech Recognition Model - Open-source and Free, Achieving Low Word Error Rate Recognition through Dataset Training

Wav2vec2 2 Bert Large No Adapter Frozen Enc

Developed by speech-seq2seq

This model is a speech recognition model trained on the librispeech_asr dataset, achieving a word error rate (WER) of 2.0133 on the evaluation set.

Speech Recognition

Transformers

#High-precision speech transcription #Low word error rate #English speech recognition

Downloads 25

Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model capable of converting speech to text. The model is trained on the librispeech_asr dataset and is suitable for English speech recognition tasks.

Model Features

Low word error rate

Achieved a word error rate (WER) of 2.0133 on the evaluation set, demonstrating good performance.

Trained on LibriSpeech

Trained using the standard librispeech_asr dataset, providing a reliable training foundation.

Optimized training configuration

Incorporates optimization techniques such as gradient accumulation, learning rate warm-up, and mixed-precision training.

Model Capabilities

English speech recognition

Speech-to-text conversion

Use Cases

Speech transcription

Audio transcription

Convert English speech content into text

Word error rate 2.0133

Assistive tools

Subtitle generation

Automatically generate subtitles for English video content

null

Training Loss	Epoch	Step	Validation Loss	Wer
5.171	0.28	500	8.6956	2.0055
5.307	0.56	1000	8.5958	2.0096
5.1449	0.84	1500	10.4208	2.0115
6.1351	1.12	2000	10.2950	2.0059
6.2997	1.4	2500	10.6762	2.0115
6.1394	1.68	3000	10.9190	2.0110
6.1868	1.96	3500	11.0166	2.0112
5.9647	2.24	4000	11.4154	2.0141
6.2202	2.52	4500	11.5837	2.0152
5.9612	2.8	5000	11.7664	2.0133

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 2 Bert Large No Adapter Frozen Enc

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Speech Recognition Model

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions