Ast Finetuned Audioset 10 10 0.4593 Finetuned Gtzan
This model is an audio classification model based on the AST architecture and fine-tuned on the GTZAN music classification dataset, achieving an accuracy of 89%
Downloads 1
Release Time : 10/25/2024
Model Overview
An audio classification model based on the Audio Spectrogram Transformer (AST) architecture, specifically fine-tuned for music genre classification tasks
Model Features
High Accuracy
Achieves 89% accuracy on the GTZAN music classification dataset
Transformer-based Architecture
Utilizes Audio Spectrogram Transformer to process audio spectrograms
Transfer Learning
Fine-tuned based on a pre-trained model from AudioSet
Model Capabilities
Music Genre Classification
Audio Feature Extraction
Spectrogram Analysis
Use Cases
Music Analysis
Automatic Music Genre Classification
Automatically identifies the music genre of audio files
Achieves 89% accuracy on the GTZAN dataset
Audio Content Analysis
Audio Content Classification
Classifies and tags audio content
Featured Recommended AI Models
Š 2025AIbase