A

Ai Light Dance Singing2 Ft Wav2vec2 Large Xlsr 53 5gram V3

Developed by gary109
An automatic speech recognition model fine-tuned based on wav2vec2-large-xlsr-53, specializing in singing voice recognition
Downloads 97
Release Time : 6/28/2022

Model Overview

This model is a fine-tuned version on the GARY109/AI_LIGHT_DANCE - ONSET-SINGING2 dataset, primarily used for singing voice recognition tasks.

Model Features

Singing Voice Recognition Optimization
Specially fine-tuned for singing voice, potentially performing better than general speech recognition models in singing scenarios
5-gram Language Model Enhancement
Integrated with a 5-gram language model, likely improving recognition accuracy
Low Word Error Rate
Achieved a word error rate (WER) of 0.2256 on the evaluation set

Model Capabilities

Singing voice recognition
Automatic speech-to-text

Use Cases

Music Technology
Singing Recording to Lyrics
Automatically convert singing recordings into text lyrics
Word error rate approximately 22.56%
Music Education Assistance
Help music learners analyze singing pronunciation accuracy
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase