WavLM-Libri-Clean-100h-Large Open-source Automatic Speech Recognition Model - Free Deployment for Accurate Speech Content Recognition

Home

Wavlm Libri Clean 100h Large

Developed by patrickvonplaten

Automatic speech recognition model fine-tuned on the LIBRISPEECH_ASR - CLEAN dataset based on microsoft/wavlm-large

Speech Recognition

Transformers

#High-precision speech recognition #LibriSpeech fine-tuning #Multi-GPU training

Downloads 8,171

Release Time : 3/2/2022

Model Overview

This model is a fine-tuned version of the WavLM-Large architecture on the LibriSpeech clean-100h dataset, focusing on English speech recognition tasks, achieving a low word error rate (WER) on the evaluation set.

Model Features

High-performance speech recognition

After fine-tuning on the LibriSpeech clean-100h dataset, the word error rate (WER) is as low as 0.0491

Based on WavLM-Large architecture

Uses Microsoft's WavLM-Large pre-trained model as the foundation, with powerful speech feature extraction capabilities

Multi-GPU training optimization

Uses 8 GPUs for distributed training, optimizing training efficiency through techniques like gradient accumulation

Model Capabilities

English speech recognition

High-precision speech-to-text

Continuous speech recognition

Use Cases

Speech transcription

Audiobook transcription

Automatically transcribes English audiobook content into text

Word error rate of 4.91% on the LibriSpeech evaluation set

Voice assistants

Voice command recognition

Used for English voice command recognition in smart devices

Training Loss	Epoch	Step	Validation Loss	Wer
0.8069	0.34	300	0.7510	0.5809
0.2483	0.67	600	0.2023	0.1929
0.1033	1.01	900	0.1123	0.1028
0.0742	1.35	1200	0.0858	0.0771
0.057	1.68	1500	0.0722	0.0663
0.0421	2.02	1800	0.0682	0.0582
0.0839	2.35	2100	0.0630	0.0534
0.0307	2.69	2400	0.0603	0.0508

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wavlm Libri Clean 100h Large

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wavlm-libri-clean-100h-large

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions