A

Ast Finetuned Audioset 10 10 0.4593 Finetuned Gtzan

Developed by eonrad
This model is an audio classification model based on the AST architecture and fine-tuned on the GTZAN music classification dataset, achieving an accuracy of 89%
Downloads 1
Release Time : 10/25/2024

Model Overview

An audio classification model based on the Audio Spectrogram Transformer (AST) architecture, specifically fine-tuned for music genre classification tasks

Model Features

High Accuracy
Achieves 89% accuracy on the GTZAN music classification dataset
Transformer-based Architecture
Utilizes Audio Spectrogram Transformer to process audio spectrograms
Transfer Learning
Fine-tuned based on a pre-trained model from AudioSet

Model Capabilities

Music Genre Classification
Audio Feature Extraction
Spectrogram Analysis

Use Cases

Music Analysis
Automatic Music Genre Classification
Automatically identifies the music genre of audio files
Achieves 89% accuracy on the GTZAN dataset
Audio Content Analysis
Audio Content Classification
Classifies and tags audio content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase