AI Light Dance Singing2 ft Wav2Vec2 Open-Source Model - Accurately Identify Singing Voices, Free Deployment and Easy to Use!

Ai Light Dance Singing2 Ft Wav2vec2 Large Xlsr 53 5gram V4 1

Developed by gary109

This model is an automatic speech recognition (ASR) model based on the wav2vec2-large-xlsr-53 architecture, fine-tuned on the GARY109/AI_LIGHT_DANCE - ONSET-SINGING2 dataset, primarily used for singing voice recognition tasks.

Speech Recognition

Transformers

#Singing voice recognition #High-precision speech transcription #Music content analysis

Downloads 66

Release Time : 6/28/2022

Model Overview

This is an automatic speech recognition model specifically optimized for singing voice, based on the wav2vec2-large-xlsr-53 architecture and fine-tuned on a specific singing dataset, capable of accurately recognizing singing content.

Model Features

Singing voice optimization

Specifically optimized for singing content, performing better in singing scenarios compared to general speech recognition models

High accuracy

Achieved a word error rate (WER) of 12.11% on the evaluation set, demonstrating good performance

Based on wav2vec2 architecture

Utilizes the powerful wav2vec2-large-xlsr-53 as the base model, featuring excellent speech feature extraction capabilities

Model Capabilities

Singing voice recognition

Automatic speech-to-text

Music content analysis

Use Cases

Music technology

Singing content transcription

Automatically convert singing recordings into text lyrics

Word error rate 12.11%

Music content analysis

Analyze singing content for music information retrieval

Training Loss	Epoch	Step	Validation Loss	Wer
0.2609	1.0	280	0.2313	0.1376
0.2297	2.0	560	0.2240	0.1397
0.1951	3.0	840	0.2280	0.1361
0.1816	4.0	1120	0.2215	0.1282
0.1634	5.0	1400	0.2180	0.1240
0.1338	6.0	1680	0.2226	0.1241
0.1411	7.0	1960	0.2143	0.1211
0.1143	8.0	2240	0.2181	0.1174
0.1127	9.0	2520	0.2215	0.1167
0.105	10.0	2800	0.2196	0.1160

Property	Details
Model Type	Fine - tuned version of gary109/ai-light-dance_singing_ft_wav2vec2-large-xlsr-53-5gram-v4
Training Data	GARY109/AI_LIGHT_DANCE - ONSET - SINGING2 dataset

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Ai Light Dance Singing2 Ft Wav2vec2 Large Xlsr 53 5gram V4 1

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 ai-light-dance_singing2_ft_wav2vec2-large-xlsr-53-5gram-v4-1

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions