Wav2vec2 Base Toy Train Data Slow 10pct
W
Wav2vec2 Base Toy Train Data Slow 10pct
Developed by scasutt
A speech recognition model fine-tuned on an unknown dataset based on facebook/wav2vec2-base, with a Word Error Rate (WER) of 0.7175
Downloads 22
Release Time : 3/27/2022
Model Overview
This model is a fine-tuned version of wav2vec2-base, primarily used for speech recognition tasks. The model demonstrates certain recognition capabilities on the evaluation set but still has room for improvement.
Model Features
Fine-tuned based on wav2vec2-base
Fine-tuned on the base wav2vec2 model to adapt to specific speech recognition tasks
Linear Learning Rate Scheduling
Adopts a linear learning rate scheduling strategy with a 1000-step warm-up period
Gradient Accumulation Training
Uses gradient accumulation (steps=2) to increase effective batch size
Model Capabilities
Speech-to-Text
Automatic Speech Recognition
Use Cases
Speech Transcription
Meeting Minutes Transcription
Convert meeting recordings into text transcripts
Word Error Rate 0.7175
Voice Command Recognition
Recognize simple voice commands
Featured Recommended AI Models
Š 2025AIbase