Wav2Vec2-base_toy_train_data_augmented Open-source Speech Recognition Model - Optimizing Training Data for Precise Speech Recognition

Home

Wav2vec2 Base Toy Train Data Augmented

Developed by scasutt

A fine-tuned speech recognition model based on facebook/wav2vec2-base, optimized with augmented training data.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Speech Recognition #Fine-tuned Model #Low Word Error Rate

Downloads 22

Release Time : 3/26/2022

Model Overview

This model is a speech recognition model based on the wav2vec2 architecture, improved in recognition accuracy through fine-tuning on specific datasets.

Model Features

Data Augmentation Training

The model employs data augmentation techniques during training to enhance generalization capabilities.

Low Word Error Rate

After fine-tuning, the model achieves a low word error rate (WER) on the validation set.

Model Capabilities

Speech Recognition

Audio to Text

Use Cases

Speech Transcription

Meeting Minutes Transcription

Automatically transcribe meeting recordings into text for easy documentation and retrieval.

Voice Assistant

Used in the speech recognition module of voice assistants to improve recognition accuracy.

Training Loss	Epoch	Step	Validation Loss	Wer
3.12	1.05	250	3.3998	0.9982
3.0727	2.1	500	3.1261	0.9982
1.9729	3.15	750	1.4868	0.9464
1.3213	4.2	1000	1.2598	0.8833
1.0508	5.25	1250	1.0014	0.8102
0.8483	6.3	1500	0.9475	0.7944
0.7192	7.35	1750	0.9493	0.7686
0.6447	8.4	2000	0.9872	0.7573
0.6064	9.45	2250	0.9587	0.7447
0.5384	10.5	2500	0.9332	0.7320
0.4985	11.55	2750	0.9926	0.7315
0.4643	12.6	3000	1.0008	0.7292
0.4565	13.65	3250	0.9522	0.7171
0.449	14.7	3500	0.9685	0.7140
0.4307	15.75	3750	1.0080	0.7077
0.4239	16.81	4000	0.9950	0.7023
0.389	17.86	4250	1.0260	0.7007
0.3471	18.91	4500	1.0012	0.6966
0.3276	19.96	4750	1.0238	0.6969

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Base Toy Train Data Augmented

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2-base_toy_train_data_augmented

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License