Wav2vec2 Base Timit Moaiz Explast
W
Wav2vec2 Base Timit Moaiz Explast
Developed by moaiz237
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base on the TIMIT dataset, primarily used for English speech-to-text tasks.
Downloads 19
Release Time : 4/30/2022
Model Overview
This is a speech recognition model based on the wav2vec2 architecture, fine-tuned on the TIMIT dataset, capable of converting English speech into text.
Model Features
Based on wav2vec2 Architecture
Utilizes the wav2vec2-base architecture developed by Facebook, featuring excellent speech feature extraction capabilities.
Fine-tuned on TIMIT Dataset
Fine-tuned on the TIMIT speech dataset, optimizing English speech recognition performance.
Relatively High Recognition Accuracy
Achieved a word error rate (WER) of 0.5404 on the evaluation set.
Model Capabilities
English Speech Recognition
Speech-to-Text
Use Cases
Speech Transcription
English Speech Transcription
Convert English speech content into text format
Word Error Rate 0.5404
Featured Recommended AI Models
Š 2025AIbase