A

Ast Finetuned Audioset 10 10 0.4593 Finetuned Gtzan

Developed by abnerh
This model is a fine-tuned version of Audio Spectrogram Transformer (AST) on the GTZAN music classification dataset for audio classification tasks, achieving an accuracy of 88%.
Downloads 2
Release Time : 10/16/2024

Model Overview

This is an audio classification model based on the Transformer architecture, specifically fine-tuned for music genre classification tasks.

Model Features

High Accuracy
Achieves 88% classification accuracy on the GTZAN dataset.
Transformer-based Architecture
Utilizes the Audio Spectrogram Transformer architecture, suitable for audio signal processing.
Pre-training + Fine-tuning
Pre-trained on the AudioSet dataset and fine-tuned on the GTZAN dataset.

Model Capabilities

Music Genre Classification
Audio Feature Extraction
Audio Signal Analysis

Use Cases

Music Analysis
Automatic Music Genre Classification
Automatically identifies the music genre of audio files.
88% accuracy
Music Recommendation System
Serves as a feature extraction component for music recommendation systems.
Audio Processing
Audio Content Analysis
Analyzes audio content features.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase