A

Ast Finetuned Audioset 10 10 0.4593 ONNX

Developed by onnx-community
This is the ONNX version of the AST (Audio Spectrogram Transformer) model, designed specifically for audio classification tasks and fine-tuned on the AudioSet dataset.
Downloads 684
Release Time : 5/1/2025

Model Overview

This model is an audio classification model based on the Transformer architecture. It processes audio by converting it into spectrograms and is suitable for various audio recognition and classification tasks.

Model Features

ONNX format
The model has been converted to the ONNX format, facilitating deployment and use on different platforms and frameworks.
Audio classification
A Transformer model specifically optimized for audio classification tasks.
Spectrogram processing
Converts audio signals into spectrograms for efficient processing.

Model Capabilities

Audio classification
Sound event detection
Audio feature extraction

Use Cases

Multimedia analysis
Sound event detection
Identify and classify specific sound events in audio.
Achieved an mAP of 0.4593 on the AudioSet dataset.
Content classification
Classify audio content, such as music, speech, environmental sounds, etc.
Intelligent monitoring
Abnormal sound detection
Detect abnormal or dangerous sounds in monitored audio.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase