AI Light Dance Singing FT Wav2Vec2-Large-XLSR-53 Open-Source Model

Ai Light Dance Singing Ft Wav2vec2 Large Xlsr 53

Developed by gary109

This model is an automatic speech recognition model fine-tuned on the AI_LIGHT_DANCE - ONSET-SINGING dataset based on facebook/wav2vec2-large-xlsr-53, primarily used for singing voice recognition tasks.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Singing voice recognition #Low word error rate #XLSR-53 fine-tuning

Downloads 23

Release Time : 6/15/2022

Model Overview

This is an automatic speech recognition model optimized for singing voice recognition tasks, fine-tuned based on the wav2vec2-large-xlsr-53 architecture, achieving a word error rate of 20.43% on the evaluation set.

Model Features

Optimized for Singing Voice Recognition

Specially fine-tuned for singing voice, performing better in singing scenarios compared to general speech recognition models.

Low Word Error Rate

Achieves a word error rate of 20.43% on the evaluation set, demonstrating good performance.

Based on XLSR Architecture

Utilizes a large-scale pre-trained model for cross-lingual speech representation learning as its foundation.

Model Capabilities

Singing voice recognition

Audio-to-text conversion

Music content analysis

Use Cases

Music Analysis

Singing Lyrics Transcription

Automatically converts singing recordings into lyric text

Word error rate 20.43%

Music Content Retrieval

Searches for music segments via lyric content

Music Education

Singing Practice Evaluation

Analyzes the alignment between singing recordings and standard lyrics

Training Loss	Epoch	Step	Validation Loss	Wer
1.4089	1.0	552	1.4750	0.9054
0.7995	2.0	1104	0.9044	0.6163
0.6232	3.0	1656	0.6645	0.3980
0.5351	4.0	2208	0.5674	0.3120
0.472	5.0	2760	0.5167	0.2579
0.3913	6.0	3312	0.4553	0.2335
0.3306	7.0	3864	0.4476	0.2114
0.3028	8.0	4416	0.4327	0.2043
0.317	9.0	4968	0.4355	0.2033
0.2494	10.0	5520	0.4405	0.2022

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Ai Light Dance Singing Ft Wav2vec2 Large Xlsr 53

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 ai-light-dance_singing_ft_wav2vec2-large-xlsr-53

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License