Wav2vec Trained
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base, achieving a word error rate of 0.1042 on the evaluation set.
Downloads 70
Release Time : 6/25/2022
Model Overview
A speech recognition model based on the wav2vec2 architecture, used to convert speech into text.
Model Features
Low Word Error Rate
Achieved a word error rate of 0.1042 on the evaluation set.
Efficient Training
Optimized training efficiency using mixed-precision training (native AMP).
Linear Learning Rate Scheduling
Adopted a linear learning rate scheduler with 1000 warm-up steps to optimize the training process.
Model Capabilities
Speech-to-Text
Automatic Speech Recognition
Use Cases
Speech Transcription
Automatic Meeting Minutes Generation
Automatically convert meeting recordings into written transcripts
Voice Memo Conversion
Convert voice memos into editable text
Featured Recommended AI Models
Š 2025AIbase