WavLM-Libri-Clean-100h-Base-Plus Open-Source Automatic Speech Recognition Model

Wavlm Libri Clean 100h Base Plus

Developed by patrickvonplaten

An automatic speech recognition model fine-tuned on the LIBRISPEECH_ASR - CLEAN dataset based on microsoft/wavlm-base-plus

Speech Recognition

Transformers

#High-precision speech recognition #LibriSpeech optimized #Low word error rate

Downloads 126.17k

Release Time : 3/2/2022

Model Overview

This model is an optimized WavLM model for English speech recognition tasks, fine-tuned on the LibriSpeech clean-100h dataset, achieving a low word error rate (WER).

Model Features

Efficient Fine-tuning

Fine-tuned based on the pre-trained WavLM-base-plus model, fully leveraging the powerful feature extraction capabilities of the pre-trained model

Low Word Error Rate

Achieved a word error rate (WER) of 0.0683 on the evaluation set, demonstrating excellent performance

Multi-GPU Training Optimization

Utilized 8-GPU parallel training with a total batch size of 32, ensuring high training efficiency

Model Capabilities

English speech recognition

Continuous speech-to-text

High-accuracy transcription

Use Cases

Speech Transcription

Audiobook Transcription

Automatically transcribe English audiobook content into text

Achieved a 6.83% word error rate on the LibriSpeech dataset

Meeting Minutes

Automatically convert English meeting recordings into written transcripts

Training Loss	Epoch	Step	Validation Loss	Wer
2.8877	0.34	300	2.8649	1.0
0.2852	0.67	600	0.2196	0.1830
0.1198	1.01	900	0.1438	0.1273
0.0906	1.35	1200	0.1145	0.1035
0.0729	1.68	1500	0.1055	0.0955
0.0605	2.02	1800	0.0936	0.0859
0.0402	2.35	2100	0.0885	0.0746
0.0421	2.69	2400	0.0848	0.0700

Property	Details
Model Type	wavlm - libri - clean - 100h - base - plus
Training Data	LIBRISPEECH_ASR - CLEAN dataset

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wavlm Libri Clean 100h Base Plus

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wavlm-libri-clean-100h-base-plus

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions