WavLM-Libri-Clean-100h-Base Open-Source Automatic Speech Recognition Model - Accurate Recognition for Easier Speech Processing

Home

Wavlm Libri Clean 100h Base

Developed by patrickvonplaten

An automatic speech recognition model fine-tuned on the LIBRISPEECH_ASR - CLEAN dataset based on microsoft/wavlm-base

Speech Recognition

Transformers

#High-precision speech recognition #LibriSpeech optimization #WavLM architecture

Downloads 6,515

Release Time : 3/2/2022

Model Overview

This model is an optimized WavLM base version for English speech recognition tasks, fine-tuned on 100 hours of clean speech data with a low word error rate.

Model Features

Efficient fine-tuning

Fine-tuned on 100 hours of clean speech data, significantly improving the recognition accuracy of the base model

Low word error rate

Achieved a word error rate (WER) of 0.0675 on the evaluation set, demonstrating excellent performance

Multi-GPU training

Utilized 8 GPUs for distributed training, enhancing training efficiency

Model Capabilities

English speech recognition

Continuous speech to text

High-accuracy transcription

Use Cases

Speech transcription

Automatic meeting minutes generation

Automatically convert meeting recordings into text transcripts

Accuracy approximately 93.25% (based on WER 0.0675 calculation)

Podcast content indexing

Generate searchable text content for audio podcasts

Assistive technology

Real-time caption generation

Provide real-time captions for video or live streaming content

Training Loss	Epoch	Step	Validation Loss	Wer
2.8805	0.34	300	2.8686	1.0
0.2459	0.67	600	0.1858	0.1554
0.1114	1.01	900	0.1379	0.1191
0.0867	1.35	1200	0.1130	0.0961
0.0698	1.68	1500	0.1032	0.0877
0.0663	2.02	1800	0.0959	0.0785
0.0451	2.35	2100	0.0887	0.0748
0.0392	2.69	2400	0.0859	0.0698

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wavlm Libri Clean 100h Base

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wavlm-libri-clean-100h-base

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions