S

Sew D Mid 400k Librispeech Clean 100h Ft

Developed by patrickvonplaten
This model is an automatic speech recognition model fine-tuned from asapp/sew-d-mid-400k on the LIBRISPEECH_ASR - CLEAN dataset, achieving a word error rate (WER) of 1.0536 on the evaluation set.
Downloads 15
Release Time : 3/2/2022

Model Overview

A model optimized for English speech recognition tasks, particularly suitable for clean speech samples in the LibriSpeech dataset.

Model Features

Efficient Speech Recognition
Optimized based on the SEW-D architecture, providing efficient speech-to-text capabilities.
Low Word Error Rate
Achieves a WER of 1.0536 on the LibriSpeech clean 100h dataset.
Multi-GPU Training Optimization
Supports distributed training, optimized for performance in multi-GPU environments.

Model Capabilities

English Speech Recognition
High-Accuracy Transcription
Processing Clean Speech Samples

Use Cases

Speech Transcription
Audiobook Transcription
Convert high-quality audiobook content into text.
Highly accurate transcription results.
Meeting Minutes
Record meeting speech in quiet environments.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase