Wavlm Libri Clean 100h Base Plus
An automatic speech recognition model fine-tuned on the LIBRISPEECH_ASR - CLEAN dataset based on microsoft/wavlm-base-plus
Downloads 126.17k
Release Time : 3/2/2022
Model Overview
This model is an optimized WavLM model for English speech recognition tasks, fine-tuned on the LibriSpeech clean-100h dataset, achieving a low word error rate (WER).
Model Features
Efficient Fine-tuning
Fine-tuned based on the pre-trained WavLM-base-plus model, fully leveraging the powerful feature extraction capabilities of the pre-trained model
Low Word Error Rate
Achieved a word error rate (WER) of 0.0683 on the evaluation set, demonstrating excellent performance
Multi-GPU Training Optimization
Utilized 8-GPU parallel training with a total batch size of 32, ensuring high training efficiency
Model Capabilities
English speech recognition
Continuous speech-to-text
High-accuracy transcription
Use Cases
Speech Transcription
Audiobook Transcription
Automatically transcribe English audiobook content into text
Achieved a 6.83% word error rate on the LibriSpeech dataset
Meeting Minutes
Automatically convert English meeting recordings into written transcripts
Featured Recommended AI Models