wav2vec2-base-checkpoint-9 Open-source Speech Recognition Model - Precise Identification, Ultra-practical with Low Word Error Rate

Wav2vec2 Base Checkpoint 9

Developed by jiobiala24

This model is a fine-tuned speech recognition model based on wav2vec2-base-checkpoint-8 on the common_voice dataset, achieving a word error rate of 0.3258 on the evaluation set.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Speech Recognition #Fine-tuned Model #Low Word Error Rate

Downloads 16

Release Time : 3/2/2022

Model Overview

This is a speech recognition model based on the wav2vec2 architecture, fine-tuned on the common_voice dataset, capable of converting speech to text.

Model Features

Low Word Error Rate

Achieved a word error rate of 0.3258 on the evaluation set, demonstrating good performance.

Based on wav2vec2 Architecture

Utilizes the advanced wav2vec2 architecture, effectively learning speech features.

Fine-tuned on common_voice Dataset

Fine-tuned using the common_voice dataset, enhancing the model's generalization capability.

Model Capabilities

Speech Recognition

Automatic Speech-to-Text

Use Cases

Speech Transcription

Voice Memo Transcription

Automatically converts voice memos into text

Approximately 67.42% accuracy (estimated based on word error rate)

Accessibility Applications

Real-time Caption Generation

Provides real-time captions for the hearing impaired

Training Loss	Epoch	Step	Validation Loss	Wer
0.2783	1.58	1000	0.5610	0.3359
0.2251	3.16	2000	0.5941	0.3374
0.173	4.74	3000	0.6026	0.3472
0.1475	6.32	4000	0.6750	0.3482
0.1246	7.9	5000	0.6673	0.3414
0.1081	9.48	6000	0.7072	0.3409
0.1006	11.06	7000	0.7413	0.3392
0.0879	12.64	8000	0.7831	0.3394
0.0821	14.22	9000	0.7371	0.3333
0.0751	15.8	10000	0.8321	0.3445
0.0671	17.38	11000	0.8362	0.3357
0.0646	18.96	12000	0.8709	0.3367
0.0595	20.54	13000	0.8352	0.3321
0.0564	22.12	14000	0.8854	0.3323
0.052	23.7	15000	0.9031	0.3315
0.0485	25.28	16000	0.9171	0.3278
0.046	26.86	17000	0.9390	0.3254
0.0438	28.44	18000	0.9203	0.3258

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Base Checkpoint 9

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2-base-checkpoint-9

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License