Wav2vec2 2 Roberta Large No Adapter Frozen Enc
W
Wav2vec2 2 Roberta Large No Adapter Frozen Enc
Developed by speech-seq2seq
This model is a speech recognition model trained on the LibriSpeech ASR dataset, capable of converting speech to text.
Downloads 27
Release Time : 3/2/2022
Model Overview
This is an Automatic Speech Recognition (ASR) model specifically designed for English speech-to-text tasks. The model is trained on the LibriSpeech dataset and is suitable for clear English speech recognition scenarios.
Model Features
High Accuracy
Achieved a Word Error Rate (WER) of 1.0008 on the LibriSpeech evaluation set
Optimized Training
Trained using the Adam optimizer and linear learning rate scheduler
Mixed Precision Training
Utilized native AMP for mixed precision training to improve training efficiency
Model Capabilities
English Speech Recognition
Speech-to-Text
Use Cases
Speech Transcription
Audiobook Transcription
Convert English audiobooks into text format
Meeting Minutes
Convert English meeting recordings into written transcripts
Featured Recommended AI Models