A

Ai Light Dance Singing Ft Wav2vec2 Large Lv60

Developed by gary109
This model is a fine-tuned automatic speech recognition model based on facebook/wav2vec2-large-lv60 using the AI_LIGHT_DANCE.PY - ONSET-SINGING dataset
Downloads 16
Release Time : 5/15/2022

Model Overview

A fine-tuned model for speech recognition tasks, specifically optimized for singing content

Model Features

Optimized for singing content recognition
Specifically fine-tuned for singing content, potentially outperforming general speech recognition models on singing content
Low word error rate
Achieved a word error rate (WER) of 0.2088 on the evaluation set, demonstrating good performance
Based on wav2vec2 architecture
Uses facebook's wav2vec2-large-lv60 as the base model, featuring powerful speech feature extraction capabilities

Model Capabilities

Speech recognition
Singing content recognition

Use Cases

Music-related applications
Singing content transcription
Convert singing audio into text
Word error rate 0.2088
Music education assistance
Help music learners analyze singing content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase