ai-light-dance_stepmania_ft_wav2vec2-large-xlsr-53-v6 Open Source Model

Ai Light Dance Stepmania Ft Wav2vec2 Large Xlsr 53 V6

Developed by gary109

This model is an automatic speech recognition (ASR) model fine-tuned on the GARY109/AI_LIGHT_DANCE - ONSET-STEPMANIA2 dataset based on wav2vec2-large-xlsr-53.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Rhythm game speech recognition #XLSR-53 fine-tuning #Low word error rate

Downloads 160

Release Time : 6/28/2022

Model Overview

This is an automatic speech recognition (ASR) model specifically optimized for audio data in Stepmania games.

Model Features

Based on wav2vec2 architecture

Uses wav2vec2-large-xlsr-53 as the base model, providing excellent speech recognition capabilities

Optimized for game audio

Fine-tuned specifically on Stepmania game audio datasets, suitable for speech recognition in gaming scenarios

Multi-round training

Trained for 10 epochs with validation loss stabilized around 1.0 and word error rate approximately 0.65

Model Capabilities

Speech recognition

Audio transcription

Game audio processing

Use Cases

Gaming

Stepmania game speech recognition

Recognizing voice commands in Stepmania games

Word error rate approximately 0.65

Speech transcription

Game audio transcription

Transcribing voice content in games into text

Training Loss	Epoch	Step	Validation Loss	Wer
0.8572	1.0	376	1.0508	0.6601
0.8671	2.0	752	1.0755	0.6581
0.8578	3.0	1128	1.0152	0.6787
0.8552	4.0	1504	1.0537	0.6557
0.8354	5.0	1880	1.0386	0.6606
0.8543	6.0	2256	1.0063	0.6580
0.8556	7.0	2632	1.0487	0.6499
0.8356	8.0	3008	1.0407	0.6549
0.8227	9.0	3384	1.0382	0.6506
0.8148	10.0	3760	1.0440	0.6500

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Ai Light Dance Stepmania Ft Wav2vec2 Large Xlsr 53 V6

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 ai-light-dance_stepmania_ft_wav2vec2-large-xlsr-53-v6

🚀 Quick Start

✨ Features

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License