Ai Light Dance Singing Ft Wav2vec2 Large Xlsr 53 5gram V1
This model is an automatic speech recognition model based on wav2vec2-large-xlsr-53, fine-tuned on the GARY109/AI_LIGHT_DANCE - ONSET-SINGING dataset, primarily used for singing voice recognition.
Downloads 18
Release Time : 6/18/2022
Model Overview
This is an automatic speech recognition model for singing voice recognition, fine-tuned based on the wav2vec2-large-xlsr-53 architecture, and performs exceptionally well on specific singing datasets.
Model Features
High-Accuracy Singing Recognition
Fine-tuned on the ONSET-SINGING dataset, specifically optimized for singing voice recognition
Based on wav2vec2 Architecture
Utilizes the powerful wav2vec2-large-xlsr-53 as the base model
Low Word Error Rate
Achieves a word error rate of 16.68% on the evaluation set
Model Capabilities
Singing Voice Recognition
Automatic Speech Transcription
Use Cases
Music Technology
Singing Content Transcription
Automatically converts singing recordings into text
Word error rate 16.68%
Music Education Assistance
Helps music learners analyze singing pronunciation
Featured Recommended AI Models
Š 2025AIbase