wav2vec2-base-timit-demo-colab50 Open-Source Speech Recognition Model

Wav2vec2 Base Timit Demo Colab50

Developed by hassnain

This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base, trained for 30 epochs on the TIMIT dataset.

Downloads 16

Release Time : 5/1/2022

Model Overview

A speech recognition model based on the wav2vec2 architecture, suitable for English speech-to-text tasks.

Based on wav2vec2 Architecture

Uses Facebook's open-source wav2vec2-base model as the foundational architecture

Fine-tuned on TIMIT Dataset

Fine-tuned for 30 epochs on the TIMIT speech dataset

Low Word Error Rate

Achieved a word error rate (WER) of 1.0 on the evaluation set

English Speech Recognition

Audio to Text Conversion

Speech Transcription

Speech to Text

Convert English speech content into text

Word Error Rate 1.0

Training Loss	Epoch	Step	Validation Loss	Wer
5.4568	7.04	500	3.3002	1.0
3.1795	14.08	1000	3.2170	1.0
3.1607	21.13	1500	3.2119	1.0
3.1537	28.17	2000	3.2257	1.0

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base