English ASR
E
English ASR
Developed by maher13
This model is a fine-tuned English Automatic Speech Recognition (ASR) model based on facebook/wav2vec2-base, achieving a word error rate of 0.3397 on the evaluation set.
Downloads 13
Release Time : 3/2/2022
Model Overview
This is a model for English speech recognition, capable of converting English speech into text.
Model Features
Low Word Error Rate
Achieved a word error rate of 0.3397 on the evaluation set, demonstrating good performance.
Based on wav2vec2 Architecture
Fine-tuned using facebook's wav2vec2-base model, inheriting its excellent speech feature extraction capabilities.
Efficient Training
Utilizes mixed-precision training (native AMP) and a linear learning rate scheduler for high training efficiency.
Model Capabilities
English Speech Recognition
Speech-to-Text
Use Cases
Speech Transcription
Meeting Minutes
Automatically convert English meeting recordings into written transcripts
Approximately 66.03% accuracy (based on 1-0.3397 word error rate)
Voice Notes
Convert English voice notes into searchable text
Assistive Tools
Subtitle Generation
Automatically generate subtitles for English video content
Featured Recommended AI Models
Š 2025AIbase