AI Light Dance Singing FT Wav2Vec2-large-xlsr-53-5gram-v1 Open Source Model

Ai Light Dance Singing Ft Wav2vec2 Large Xlsr 53 5gram V1

Developed by gary109

This model is an automatic speech recognition model based on wav2vec2-large-xlsr-53, fine-tuned on the GARY109/AI_LIGHT_DANCE - ONSET-SINGING dataset, primarily used for singing voice recognition.

Speech Recognition

Transformers

#Singing Voice Recognition #XLSR-53 Fine-tuning #Low Word Error Rate

Downloads 18

Release Time : 6/18/2022

Model Overview

This is an automatic speech recognition model for singing voice recognition, fine-tuned based on the wav2vec2-large-xlsr-53 architecture, and performs exceptionally well on specific singing datasets.

Model Features

High-Accuracy Singing Recognition

Fine-tuned on the ONSET-SINGING dataset, specifically optimized for singing voice recognition

Based on wav2vec2 Architecture

Utilizes the powerful wav2vec2-large-xlsr-53 as the base model

Low Word Error Rate

Achieves a word error rate of 16.68% on the evaluation set

Model Capabilities

Singing Voice Recognition

Automatic Speech Transcription

Use Cases

Music Technology

Singing Content Transcription

Automatically converts singing recordings into text

Word error rate 16.68%

Music Education Assistance

Helps music learners analyze singing pronunciation

Training Loss	Epoch	Step	Validation Loss	Wer
0.2696	1.0	552	0.4421	0.2013
0.2498	2.0	1104	0.4389	0.1887
0.2387	3.0	1656	0.4154	0.1788
0.1902	4.0	2208	0.4143	0.1753
0.1896	5.0	2760	0.4123	0.1668
0.1658	6.0	3312	0.4366	0.1651
0.1312	7.0	3864	0.4309	0.1594
0.1186	8.0	4416	0.4432	0.1561
0.1476	9.0	4968	0.4400	0.1569
0.1027	10.0	5520	0.4389	0.1554

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Ai Light Dance Singing Ft Wav2vec2 Large Xlsr 53 5gram V1

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 ai-light-dance_singing_ft_wav2vec2-large-xlsr-53-5gram-v1

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions