Wav2vec2 Base Timit Demo Colab 32 Epochs30
W
Wav2vec2 Base Timit Demo Colab 32 Epochs30
Developed by ying-tina
A speech recognition model fine-tuned from facebook/wav2vec2-base, trained for 30 epochs on the TIMIT dataset
Downloads 22
Release Time : 3/2/2022
Model Overview
This model is a fine-tuned version of wav2vec2-base for English speech recognition, suitable for Automatic Speech Recognition (ASR) tasks
Model Features
Efficient Fine-tuning
Efficiently fine-tuned from the wav2vec2-base model with only 30 training epochs
Low Word Error Rate
Achieves a Word Error Rate (WER) of 0.3434 on the evaluation set
Lightweight
Based on the wav2vec2-base architecture, relatively lightweight and suitable for deployment
Model Capabilities
English speech recognition
Audio-to-text conversion
Use Cases
Speech Transcription
Meeting Minutes
Automatically transcribe English meeting recordings into text
Word Error Rate around 34.34%
Voice Notes
Convert English voice notes to text
Featured Recommended AI Models
Š 2025AIbase