D

Distil Ast Audioset

Developed by bookbot
An audio classification model based on the audio spectrum transformer architecture, distilled from the original AST AudioSet model, suitable for audio classification tasks.
Downloads 917
Release Time : 3/20/2023

Model Overview

This model is a distilled version of MIT/ast-finetuned-audioset-10-10-0.4593 on the AudioSet dataset, primarily used for audio classification tasks.

Model Features

Distilled Model
By distilling the original AST AudioSet model, the number of parameters is reduced while maintaining good performance.
High-Performance Audio Classification
Performs excellently on the AudioSet dataset, achieving an F1 score of 0.4876 and ROC AUC of 0.7140.
Efficient Training
Trained using HuggingFace's PyTorch framework, supports mixed-precision training, optimizing training efficiency.

Model Capabilities

Audio Classification
Spectrum Analysis
Multi-label Classification

Use Cases

Audio Processing
Environmental Sound Classification
Used to identify and classify various environmental sounds, such as animal calls, vehicle sounds, etc.
Achieves an F1 score of 0.4876 and ROC AUC of 0.7140.
Music Classification
Used to classify music, identifying different music genres or instrument sounds.
Average precision (mAP) is 0.4743.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase