wav2vec2-large-xlsr-53-demo-colab Open Source Speech Recognition Model - Precise and Robust Speech Event Recognition

Wav2vec2 Large Xlsr 53 Demo Colab

Developed by emre

This model is a speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-large-xlsr-53, primarily used for robust speech event recognition.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Multilingual speech recognition #Robust speech processing #XLSR fine-tuning

Downloads 16

Release Time : 3/2/2022

Model Overview

This is a speech recognition model based on the wav2vec2 architecture, fine-tuned for the common_voice dataset, capable of converting speech to text.

Model Features

Based on wav2vec2 architecture

Uses facebook's wav2vec2-large-xlsr-53 as the base model, featuring powerful speech feature extraction capabilities.

Fine-tuned on Common Voice dataset

Fine-tuned on the Common Voice dataset, enhancing the model's robustness and adaptability.

Relatively low word error rate

Achieved a word error rate (WER) of 0.4834 on the evaluation set, demonstrating good performance.

Model Capabilities

Speech recognition

Speech-to-text

Robust speech event detection

Use Cases

Speech transcription

Automatically convert speech content into text format

Word error rate 0.4834

Voice assistant

Voice command recognition

Recognize user voice commands and convert them into executable commands

Training Loss	Epoch	Step	Validation Loss	Wer
5.1516	4.21	400	2.7673	1.0
0.9134	8.42	800	0.4618	0.6418
0.3273	12.63	1200	0.4188	0.5535
0.2252	16.84	1600	0.4144	0.5232
0.1692	21.05	2000	0.3995	0.5030
0.1355	25.26	2400	0.4073	0.4920
0.1172	29.47	2800	0.3966	0.4834

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Large Xlsr 53 Demo Colab

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2-large-xlsr-53-demo-colab

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License