The open-source speech recognition model wav2vec2_common_voice_accents_3

Home

Wav2vec2 Common Voice Accents 3

Developed by willcai

A speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Multi-accent speech recognition #Fine-tuned XLS-R architecture #Low-loss speech model

Downloads 16

Release Time : 3/16/2022

Model Overview

This is a model optimized for multi-accent speech recognition, fine-tuned on the wav2vec2-xls-r-300m architecture, suitable for general speech recognition tasks

Model Features

Multi-accent support

Fine-tuned on the Common Voice dataset, capable of recognizing speech with various accents

Efficient training

Uses mixed-precision training and distributed training techniques to improve training efficiency

Low validation loss

After 30 training epochs, the validation loss dropped to 0.0042, demonstrating excellent performance

Model Capabilities

Speech recognition

Multi-accent speech processing

Audio feature extraction

Use Cases

Speech-to-text

Meeting transcription

Automatically convert meeting recordings into text transcripts

Highly accurate text transcription

Voice assistant

Serves as the foundational recognition engine for voice assistants

Supports user input with various accents

Speech analysis

Accent recognition

Identify and analyze different accent features in speech

Can be used for linguistic research or market analysis

Training Loss	Epoch	Step	Validation Loss
4.584	1.27	400	1.1439
0.481	2.55	800	0.1986
0.2384	3.82	1200	0.1060
0.1872	5.1	1600	0.1016
0.158	6.37	2000	0.0942
0.1427	7.64	2400	0.0646
0.1306	8.92	2800	0.0612
0.1197	10.19	3200	0.0423
0.1129	11.46	3600	0.0381
0.1054	12.74	4000	0.0326
0.0964	14.01	4400	0.0293
0.0871	15.29	4800	0.0239
0.0816	16.56	5200	0.0168
0.0763	17.83	5600	0.0202
0.0704	19.11	6000	0.0224
0.0669	20.38	6400	0.0208
0.063	21.66	6800	0.0074
0.0585	22.93	7200	0.0126
0.0548	24.2	7600	0.0086
0.0512	25.48	8000	0.0080
0.0487	26.75	8400	0.0052
0.0455	28.03	8800	0.0062
0.0433	29.3	9200	0.0042

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Common Voice Accents 3

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2_common_voice_accents_3

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License