ai-light-dance_singing_ft_wav2vec2-large-lv60-v2 Open Source Model

Ai Light Dance Singing Ft Wav2vec2 Large Lv60 V2

Developed by gary109

This model is an automatic speech recognition model fine-tuned on the ONSET-SINGING dataset based on wav2vec2-large-lv60, focusing on singing voice recognition tasks.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Singing Voice Recognition #High-precision WER #Fine-tuned wav2vec2

Downloads 16

Release Time : 5/18/2022

Model Overview

This is an automatic speech recognition model optimized for singing voice recognition, performing well on the Word Error Rate (WER) metric.

Model Features

Singing Voice Optimization

Specially fine-tuned for singing voice, outperforming general speech recognition models on singing voice recognition tasks.

Low Word Error Rate

Achieved a Word Error Rate (WER) of 0.1858 on the evaluation set, demonstrating excellent performance.

Based on wav2vec2 Architecture

Utilizes Facebook's wav2vec2-large-lv60 as the base model, featuring powerful speech feature extraction capabilities.

Model Capabilities

Singing Voice Recognition

Automatic Speech-to-Text

Music Content Analysis

Use Cases

Music Technology

Singing Voice to Lyrics

Automatically convert singing recordings into lyric text

Word Error Rate 0.1858

Music Content Analysis

Analyze lyric content in songs

Entertainment Applications

Karaoke Lyric Synchronization

Real-time recognition of singing content and synchronized lyric display

Training Loss	Epoch	Step	Validation Loss	Wer
0.2775	1.0	1106	0.4372	0.2117
0.2154	2.0	2212	0.4474	0.2044
0.2023	3.0	3318	0.4372	0.1920
0.186	4.0	4424	0.4285	0.1858
0.1856	5.0	5530	0.4589	0.1826
0.1537	6.0	6636	0.4658	0.1774
0.1337	7.0	7742	0.4769	0.1744
0.108	8.0	8848	0.4604	0.1724
0.1593	9.0	9954	0.4731	0.1694
0.0904	10.0	11060	0.4843	0.1683

Property	Details
Model Type	Fine - tuned version of gary109/ai-light-dance_singing_ft_wav2vec2-large-lv60
Tags	automatic-speech-recognition, ../AI_Light_Dance.py, generated_from_trainer

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Ai Light Dance Singing Ft Wav2vec2 Large Lv60 V2

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 ai-light-dance_singing_ft_wav2vec2-large-lv60-v2

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License