Wav2vec2 Base Timit Demo Colab50
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base, trained for 30 epochs on the TIMIT dataset.
Downloads 16
Release Time : 5/1/2022
Model Overview
A speech recognition model based on the wav2vec2 architecture, suitable for English speech-to-text tasks.
Model Features
Based on wav2vec2 Architecture
Uses Facebook's open-source wav2vec2-base model as the foundational architecture
Fine-tuned on TIMIT Dataset
Fine-tuned for 30 epochs on the TIMIT speech dataset
Low Word Error Rate
Achieved a word error rate (WER) of 1.0 on the evaluation set
Model Capabilities
English Speech Recognition
Audio to Text Conversion
Use Cases
Speech Transcription
Speech to Text
Convert English speech content into text
Word Error Rate 1.0
Featured Recommended AI Models
Š 2025AIbase