Wsj0 Full Supervised
W
Wsj0 Full Supervised
Developed by Kuray107
This model is a speech recognition model fine-tuned on the WSJ0 dataset based on facebook/wav2vec2-large-lv60, achieving a word error rate of 0.0343 on the evaluation set.
Downloads 26
Release Time : 3/2/2022
Model Overview
A supervised learning model optimized for English speech recognition tasks, fine-tuned based on the wav2vec2 architecture.
Model Features
Low Word Error Rate
Achieves a word error rate of 3.43% on the evaluation set, demonstrating excellent performance.
Based on wav2vec2 Architecture
Uses facebook's wav2vec2-large-lv60 as the base model.
Supervised Fine-tuning
Fine-tuned with supervised training on the WSJ0 dataset.
Model Capabilities
English Speech Recognition
Audio to Text Conversion
Use Cases
Speech Transcription
Meeting Minutes Transcription
Automatically converts English meeting recordings into text transcripts.
Highly accurate transcription results
Voice Memo Conversion
Converts voice memos into searchable text.
Featured Recommended AI Models
Š 2025AIbase