Wav2vec2-base-cv-10000 Open-source Speech Recognition Model - Precise Recognition to Boost Speech Processing

Wav2vec2 Base Cv 10000

Developed by jiobiala24

A speech recognition model fine-tuned on the Common Voice dataset based on wav2vec2-base-cv, achieving a word error rate of 36.84% on the evaluation set.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Speech Recognition #Low Word Error Rate #Common Voice Dataset

Downloads 28

Release Time : 3/8/2022

Model Overview

This model is a speech recognition model based on the wav2vec2 architecture, fine-tuned on the Common Voice dataset, suitable for speech-to-text tasks.

Model Features

Low Word Error Rate

Achieved a word error rate of 36.84% on the evaluation set, demonstrating good performance.

Based on wav2vec2 Architecture

Utilizes the wav2vec2-base architecture, which has excellent speech feature extraction capabilities.

Fine-tuning Optimization

Fine-tuned for 30 epochs on the Common Voice dataset, optimizing model performance.

Model Capabilities

Speech Recognition

Speech-to-Text

Use Cases

Speech Transcription

Meeting Minutes

Convert meeting speech into real-time text records

Accuracy approximately 63.16% (based on a 36.84% word error rate)

Voice Notes

Convert voice notes into editable text

Assistive Technology

Voice Control

Provide text conversion functionality for voice control applications

Training Loss	Epoch	Step	Validation Loss	Wer
0.4243	1.6	1000	0.7742	0.4210
0.3636	3.2	2000	0.8621	0.4229
0.2638	4.8	3000	0.9328	0.4094
0.2273	6.4	4000	0.9556	0.4087
0.187	8.0	5000	0.9093	0.4019
0.1593	9.6	6000	0.9842	0.4029
0.1362	11.2	7000	1.0651	0.4077
0.1125	12.8	8000	1.0550	0.3959
0.103	14.4	9000	1.1919	0.4002
0.0948	16.0	10000	1.1901	0.3983
0.0791	17.6	11000	1.1091	0.3860
0.0703	19.2	12000	1.2823	0.3904
0.0641	20.8	13000	1.2625	0.3817
0.057	22.4	14000	1.2821	0.3776
0.0546	24.0	15000	1.2975	0.3770
0.0457	25.6	16000	1.2998	0.3714
0.0433	27.2	17000	1.3574	0.3721
0.0423	28.8	18000	1.3393	0.3684

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Base Cv 10000

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2-base-cv-10000

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License