Wav2vec2 2 Gpt2 Regularisation
W
Wav2vec2 2 Gpt2 Regularisation
Developed by sanchit-gandhi
This is an automatic speech recognition (ASR) model trained on the LibriSpeech dataset, capable of converting English speech into text.
Downloads 20
Release Time : 3/17/2022
Model Overview
This model is an automatic speech recognition model trained from scratch on the LibriSpeech ASR dataset, primarily used for English speech-to-text tasks.
Model Features
High Accuracy
Achieves a low word error rate (WER) on the LibriSpeech evaluation set
End-to-End Training
The model is trained from scratch and does not rely on pre-trained models
Optimized Training
Uses the Adam optimizer and linear learning rate scheduler for training
Model Capabilities
English Speech Recognition
Continuous Speech-to-Text
Large-Scale Speech Data Processing
Use Cases
Speech Transcription
Audiobook Transcription
Automatically transcribe English audiobooks into text
Word error rate is approximately 0.9977
Meeting Minutes
Automatically record English meeting content
Voice Assistants
Voice Command Recognition
Recognize English voice commands
Featured Recommended AI Models