Ast Finetuned Audioset 10 10 0.4593 ONNX
This is the ONNX version of the AST (Audio Spectrogram Transformer) model, designed specifically for audio classification tasks and fine-tuned on the AudioSet dataset.
Downloads 684
Release Time : 5/1/2025
Model Overview
This model is an audio classification model based on the Transformer architecture. It processes audio by converting it into spectrograms and is suitable for various audio recognition and classification tasks.
Model Features
ONNX format
The model has been converted to the ONNX format, facilitating deployment and use on different platforms and frameworks.
Audio classification
A Transformer model specifically optimized for audio classification tasks.
Spectrogram processing
Converts audio signals into spectrograms for efficient processing.
Model Capabilities
Audio classification
Sound event detection
Audio feature extraction
Use Cases
Multimedia analysis
Sound event detection
Identify and classify specific sound events in audio.
Achieved an mAP of 0.4593 on the AudioSet dataset.
Content classification
Classify audio content, such as music, speech, environmental sounds, etc.
Intelligent monitoring
Abnormal sound detection
Detect abnormal or dangerous sounds in monitored audio.
Featured Recommended AI Models