Ast Finetuned Audioset 10 10 0.4593 Finetuned Gtzan
Bsd-3-clause
This is an audio classification model based on the AST (Audio Spectrogram Transformer) architecture, fine-tuned on the GTZAN music genre classification dataset with an accuracy rate of 92%.
Audio Classification
Transformers