ast-finetuned-audioset Open-source Audio Classification Model - Precisely Identify Music Genres with 92% Accuracy

Ast Finetuned Audioset 10 10 0.4593 Finetuned Gtzan

Developed by Bhanu9Prakash

This is an audio classification model based on the AST (Audio Spectrogram Transformer) architecture, fine-tuned on the GTZAN music genre classification dataset with an accuracy rate of 92%.

Audio Classification

Transformers

Open Source License:Bsd-3-clause #Music classification #High accuracy #Audio feature extraction

Downloads 50

Release Time : 7/11/2023

Model Overview

This model is specifically designed for music genre classification tasks and can identify 10 different music genres. Based on the AST architecture, it was pre-trained on the AudioSet dataset and then fine-tuned on the GTZAN dataset.

Model Features

High accuracy

Achieves 92% classification accuracy on the GTZAN test set.

Transformer-based architecture

Utilizes the AST (Audio Spectrogram Transformer) architecture for processing audio spectrograms.

Two-stage training

Pre-trained on the large-scale AudioSet dataset and then fine-tuned on the GTZAN dataset.

Model Capabilities

Music genre classification

Audio feature extraction

Audio content analysis

Use Cases

Music services

Automatic music classification

Automatically classify uploaded music files for music streaming platforms.

Accurately identifies 10 music genres.

Music analysis

Music recommendation system

Feature extraction and classification based on music content.

Enhances the content understanding capability of recommendation systems.

Property	Details
Model Type	Fine - tuned version of ast - finetuned - audioset - 10 - 10 - 0.4593 on GTZAN dataset
Training Data	marsyas/gtzan
Metrics	Accuracy

Training Loss	Epoch	Step	Validation Loss	Accuracy
1.0687	1.0	113	0.6197	0.84
0.299	2.0	226	0.5065	0.86
0.2634	3.0	339	0.5042	0.88
0.0473	4.0	452	0.5413	0.88
0.0033	5.0	565	0.3706	0.91
0.0003	6.0	678	0.4485	0.9
0.2538	7.0	791	0.4006	0.9
0.0002	8.0	904	0.3985	0.9
0.003	9.0	1017	0.3952	0.91
0.0001	10.0	1130	0.3966	0.92

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Ast Finetuned Audioset 10 10 0.4593 Finetuned Gtzan

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 ast-finetuned-audioset-10-10-0.4593-finetuned-gtzan

📄 License

📚 Documentation

Model Information

Model Performance

Model Index

Training Procedure

Training Hyperparameters

Training Results

Framework Versions