Sew D Mid 400k Librispeech Clean 100h Ft
This model is an automatic speech recognition model fine-tuned from asapp/sew-d-mid-400k on the LIBRISPEECH_ASR - CLEAN dataset, achieving a word error rate (WER) of 1.0536 on the evaluation set.
Downloads 15
Release Time : 3/2/2022
Model Overview
A model optimized for English speech recognition tasks, particularly suitable for clean speech samples in the LibriSpeech dataset.
Model Features
Efficient Speech Recognition
Optimized based on the SEW-D architecture, providing efficient speech-to-text capabilities.
Low Word Error Rate
Achieves a WER of 1.0536 on the LibriSpeech clean 100h dataset.
Multi-GPU Training Optimization
Supports distributed training, optimized for performance in multi-GPU environments.
Model Capabilities
English Speech Recognition
High-Accuracy Transcription
Processing Clean Speech Samples
Use Cases
Speech Transcription
Audiobook Transcription
Convert high-quality audiobook content into text.
Highly accurate transcription results.
Meeting Minutes
Record meeting speech in quiet environments.
Featured Recommended AI Models
Š 2025AIbase