AI-light-dance_singing_ft_wav2vec2-large-lv60 Open Source Model - Accurately Achieve Automatic Speech Recognition

Ai Light Dance Singing Ft Wav2vec2 Large Lv60

Developed by gary109

This model is a fine-tuned automatic speech recognition model based on facebook/wav2vec2-large-lv60 using the AI_LIGHT_DANCE.PY - ONSET-SINGING dataset

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Singing voice recognition #Low word error rate #Audio fine-tuning

Downloads 16

Release Time : 5/15/2022

Model Overview

A fine-tuned model for speech recognition tasks, specifically optimized for singing content

Model Features

Optimized for singing content recognition

Specifically fine-tuned for singing content, potentially outperforming general speech recognition models on singing content

Low word error rate

Achieved a word error rate (WER) of 0.2088 on the evaluation set, demonstrating good performance

Based on wav2vec2 architecture

Uses facebook's wav2vec2-large-lv60 as the base model, featuring powerful speech feature extraction capabilities

Model Capabilities

Speech recognition

Singing content recognition

Use Cases

Music-related applications

Singing content transcription

Convert singing audio into text

Word error rate 0.2088

Music education assistance

Help music learners analyze singing content

Training Loss	Epoch	Step	Validation Loss	Wer
0.7432	1.0	4422	0.8939	0.6323
0.5484	2.0	8844	0.6393	0.3557
0.3919	3.0	13266	0.5315	0.2833
0.421	4.0	17688	0.5234	0.2522
0.3957	5.0	22110	0.5125	0.2247
0.3228	6.0	26532	0.4542	0.2088
0.346	7.0	30954	0.4673	0.1997
0.1637	8.0	35376	0.4583	0.1910
0.1508	9.0	39798	0.4623	0.1837
0.1564	10.0	44220	0.4717	0.1835

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Ai Light Dance Singing Ft Wav2vec2 Large Lv60

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 ai-light-dance_singing_ft_wav2vec2-large-lv60

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License