A

Ai Light Dance Singing Ft Pretrain Wav2vec2 Large Lv60

Developed by gary109
This model is an automatic speech recognition (ASR) model based on the wav2vec2-large-lv60 architecture, fine-tuned on the GARY109/AI_LIGHT_DANCE - ONSET-SINGING dataset, primarily used for singing voice recognition tasks.
Downloads 22
Release Time : 6/11/2022

Model Overview

This is an automatic speech recognition model focused on singing voice recognition, fine-tuned based on the wav2vec2-large-lv60 architecture, suitable for music-related speech recognition scenarios.

Model Features

Singing voice recognition optimization
Specially fine-tuned for singing voice recognition tasks, potentially outperforming general speech recognition models in music scenarios.
Based on wav2vec2 architecture
Utilizes Facebook's wav2vec2-large-lv60 pre-trained model as the foundation, featuring powerful speech feature extraction capabilities.
Low-resource adaptation
Adapted to specific domains through fine-tuning, suitable for domain adaptation with limited data.

Model Capabilities

Singing voice recognition
Automatic speech recognition
Music content transcription

Use Cases

Music technology
Song lyrics transcription
Automatically transcribe sung songs into written lyrics
Word Error Rate (WER) approximately 0.92
Music education assistance
Help music learners identify and correct singing pronunciation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase