Wavlm Base En
An English automatic speech recognition (ASR) model fine-tuned based on microsoft/wavlm-base, trained on the english_ASR - CLEAN dataset with a word error rate (WER) of 0.0773.
Downloads 17
Release Time : 3/2/2022
Model Overview
This model is a WavLM base model specifically optimized for English speech recognition tasks, suitable for high-precision English speech-to-text applications.
Model Features
Low Word Error Rate
Achieves a word error rate (WER) of 0.0773 on the evaluation set, demonstrating excellent performance.
Based on WavLM Architecture
Fine-tuned from Microsoft's WavLM-base model, inheriting its powerful speech representation capabilities.
Optimized Training
Utilizes carefully tuned training parameters and a linear learning rate scheduling strategy.
Model Capabilities
English Speech Recognition
High-precision Speech-to-Text
Use Cases
Speech Transcription
Meeting Minutes
Automatically convert English meeting recordings into text transcripts
Highly accurate transcription results
Subtitle Generation
Automatically generate subtitles for English video content
Voice Assistants
Voice Command Recognition
Recognize and understand English voice commands
Featured Recommended AI Models
Š 2025AIbase