**ai - light - dance_stepmania_ft_wav2vec2 - large - xlsr - 53 - v4 Open - source Speech Recognition Model

Ai Light Dance Stepmania Ft Wav2vec2 Large Xlsr 53 V4

Developed by gary109

This model is an automatic speech recognition model fine-tuned on the GARY109/AI_LIGHT_DANCE - ONSET-STEPMANIA2 dataset, based on gary109/ai-light-dance_stepmania_ft_wav2vec2-large-xlsr-53-v3.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Speech recognition optimization #Dance rhythm analysis #Low word error rate

Downloads 189

Release Time : 6/26/2022

Model Overview

This is an automatic speech recognition (ASR) model based on the fine-tuned wav2vec2-large-xlsr-53 architecture, primarily used for speech recognition tasks related to musical rhythm.

Model Features

Fine-tuned based on wav2vec2-large-xlsr-53

Utilizes the powerful wav2vec2-large-xlsr-53 architecture as the base model, optimized for specific tasks.

Speech recognition for musical rhythm

Specifically trained for speech recognition tasks related to musical rhythm.

Continuously improved version

This is the v4 version, further optimized based on the previous v3 version.

Model Capabilities

Speech recognition

Musical rhythm-related speech processing

Use Cases

Music games

StepMania game speech recognition

Used to recognize voice commands in the music game StepMania

Music education

Rhythm training assistance

Helps music learners identify and follow rhythm instructions

Training Loss	Epoch	Step	Validation Loss	Wer
0.9218	1.0	188	1.0718	0.6958
0.9194	2.0	376	1.0354	0.6937
0.9077	3.0	564	1.0365	0.6730
0.8956	4.0	752	1.0497	0.6727
0.877	5.0	940	1.0299	0.6694
0.8736	6.0	1128	1.0298	0.6642
0.8769	7.0	1316	1.0348	0.6584
0.8571	8.0	1504	1.0689	0.6602
0.8573	9.0	1692	1.0559	0.6549
0.8458	10.0	1880	1.0706	0.6588

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Ai Light Dance Stepmania Ft Wav2vec2 Large Xlsr 53 V4

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 ai-light-dance_stepmania_ft_wav2vec2-large-xlsr-53-v4

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License