wav2vec2-large-xls-r-300m-pt-colab Open-source Speech Recognition Model - Accurately Identify Speech Content for Free

Home

Wav2vec2 Large Xls R 300m Pt Colab

Developed by tonyalves

A speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Speech Recognition #Multilingual Support #Low Word Error Rate

Downloads 17

Release Time : 3/2/2022

Model Overview

This model is a pre-trained model for speech recognition tasks, capable of converting speech to text after fine-tuning.

Model Features

Efficient Speech Recognition

Based on the wav2vec2 architecture, it can efficiently and accurately convert speech to text

Large-scale Pretraining

A large-scale pre-trained model with 300 million parameters, featuring powerful feature extraction capabilities

Fine-tuning Optimization

Fine-tuned on the common_voice dataset, optimizing recognition performance

Model Capabilities

Speech Recognition

Audio-to-Text Conversion

Automatic Speech Transcription

Use Cases

Speech Transcription

Meeting Minutes

Automatically convert meeting recordings into text transcripts

Word error rate around 30%

Subtitle Generation

Automatically generate subtitles for video content

Voice Assistants

Voice Command Recognition

Recognize user voice commands

Training Loss	Epoch	Step	Validation Loss	Wer
4.591	1.15	400	0.9128	0.6517
0.5049	2.31	800	0.4596	0.4437
0.2871	3.46	1200	0.3964	0.3905
0.2077	4.61	1600	0.3958	0.3744
0.1695	5.76	2000	0.4040	0.3720
0.1478	6.92	2400	0.3866	0.3651
0.1282	8.07	2800	0.3987	0.3674
0.1134	9.22	3200	0.4128	0.3688
0.1048	10.37	3600	0.3928	0.3561
0.0938	11.53	4000	0.4048	0.3619
0.0848	12.68	4400	0.4229	0.3555
0.0798	13.83	4800	0.3974	0.3468
0.0688	14.98	5200	0.3870	0.3503
0.0658	16.14	5600	0.3875	0.3351
0.061	17.29	6000	0.4133	0.3417
0.0569	18.44	6400	0.3915	0.3414
0.0526	19.6	6800	0.3957	0.3231
0.0468	20.75	7200	0.4110	0.3301
0.0407	21.9	7600	0.3866	0.3186
0.0384	23.05	8000	0.3976	0.3193
0.0363	24.21	8400	0.3910	0.3177
0.0313	25.36	8800	0.3656	0.3109
0.0293	26.51	9200	0.3712	0.3092
0.0277	27.66	9600	0.3613	0.3054
0.0249	28.82	10000	0.3783	0.3015
0.0234	29.97	10400	0.3637	0.2982

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Large Xls R 300m Pt Colab

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2-large-xls-r-300m-pt-colab

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License