Ast Finetuned Audioset 10 10 0.4593 Finetuned Gtzan
A
Ast Finetuned Audioset 10 10 0.4593 Finetuned Gtzan
Developed by abnerh
This model is a fine-tuned version of Audio Spectrogram Transformer (AST) on the GTZAN music classification dataset for audio classification tasks, achieving an accuracy of 88%.
Downloads 2
Release Time : 10/16/2024
Model Overview
This is an audio classification model based on the Transformer architecture, specifically fine-tuned for music genre classification tasks.
Model Features
High Accuracy
Achieves 88% classification accuracy on the GTZAN dataset.
Transformer-based Architecture
Utilizes the Audio Spectrogram Transformer architecture, suitable for audio signal processing.
Pre-training + Fine-tuning
Pre-trained on the AudioSet dataset and fine-tuned on the GTZAN dataset.
Model Capabilities
Music Genre Classification
Audio Feature Extraction
Audio Signal Analysis
Use Cases
Music Analysis
Automatic Music Genre Classification
Automatically identifies the music genre of audio files.
88% accuracy
Music Recommendation System
Serves as a feature extraction component for music recommendation systems.
Audio Processing
Audio Content Analysis
Analyzes audio content features.
Featured Recommended AI Models
Š 2025AIbase