A

Ast Finetuned Audioset 10 10 0.4593 Finetuned Gtzan

Developed by nomad-ai
This is an audio classification model based on the AST (Audio Spectrogram Transformer) architecture, fine-tuned on the GTZAN music genre classification dataset.
Downloads 15
Release Time : 8/9/2023

Model Overview

This model is specifically designed for music genre classification tasks and can identify 10 different music genres. It processes audio spectrograms using the Transformer architecture and achieves 90% accuracy on the GTZAN dataset.

Model Features

High Accuracy
Achieves 90% accuracy on the GTZAN music genre classification task.
Transformer-based Architecture
Uses Audio Spectrogram Transformer to process audio spectrograms, effectively capturing audio features.
Pre-training + Fine-tuning
Pre-trained on the AudioSet dataset and then fine-tuned on the GTZAN dataset.

Model Capabilities

Music Genre Classification
Audio Feature Extraction
Audio Content Analysis

Use Cases

Music Services
Automatic Music Classification
Automatically classify uploaded music files for music streaming platforms.
Accurately identifies 10 different music genres.
Playlist Generation
Automatically generate personalized playlists based on music genres.
Music Research
Music Style Analysis
Assist musicology research in analyzing features of different music styles.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase