Open-Source Audio Classification Model distilhubert-finetuned-gtzan - Accurately Identify Music Genres with 91% Accuracy

Distilhubert Finetuned Gtzan

Developed by NicolasDenier

An audio classification model based on the DistilHuBERT architecture, fine-tuned on the GTZAN music genre classification dataset with 91% accuracy

Audio Classification

Transformers

Open Source License:Apache-2.0 #Music Classification #High Accuracy #Lightweight Audio Model

Downloads 17

Release Time : 7/19/2023

Model Overview

This model is a fine-tuned version of DistilHuBERT, specifically designed for music genre classification tasks. It achieves efficient audio feature extraction through the compressed HuBERT architecture and performs excellently on the GTZAN dataset.

Model Features

Efficient Compressed Architecture

Lightweight architecture based on DistilHuBERT, reducing computational resource requirements while maintaining performance

High Accuracy

Achieves 91% accuracy on the GTZAN test set, demonstrating excellent performance

Fast Training

Through fine-tuning the pre-trained model, good performance can be achieved with just 18 training epochs

Model Capabilities

Music Genre Classification

Audio Feature Extraction

Music Content Analysis

Use Cases

Music Services

Automatic Music Classification

Automatically tag uploaded music genres for music streaming platforms

Automatic classification with 91% accuracy

Music Research

Music Feature Analysis

Study the differences in audio features across different music genres

Property	Details
Model Type	Fine-tuned version of ntu-spml/distilhubert
Training Data	marsyas/gtzan
Metrics	Accuracy

Training Loss	Epoch	Step	Validation Loss	Accuracy
2.2281	1.0	112	2.1128	0.26
1.7082	2.0	225	1.6252	0.52
1.267	3.0	337	1.3100	0.54
1.1791	4.0	450	1.0496	0.71
1.1765	5.0	562	0.8928	0.74
0.5714	6.0	675	0.8298	0.77
0.4869	7.0	787	0.7145	0.79
0.4967	8.0	900	0.6990	0.82
0.8314	9.0	1012	0.5657	0.83
0.4633	10.0	1125	0.4589	0.89
0.5547	11.0	1237	0.4919	0.86
0.4827	12.0	1350	0.4069	0.92
0.324	13.0	1462	0.4634	0.87
0.5224	14.0	1575	0.4419	0.86
0.1873	15.0	1687	0.3988	0.89
0.2852	16.0	1800	0.3788	0.9
0.3169	17.0	1912	0.3526	0.89
0.4491	17.92	2016	0.3539	0.91

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Distilhubert Finetuned Gtzan

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 distilhubert-finetuned-gtzan

📚 Documentation

Model Information

Model Performance

Model Index

🔧 Technical Details

Training and Evaluation Data

Training Procedure

Training Hyperparameters

Training Results

Framework Versions

📄 License