ast-finetuned-audioset-10-10-0.4593-ONNX Open Source Model - Empowering Precise Audio Classification Tasks

Ast Finetuned Audioset 10 10 0.4593 ONNX

Developed by onnx-community

This is the ONNX version of the AST (Audio Spectrogram Transformer) model, designed specifically for audio classification tasks and fine-tuned on the AudioSet dataset.

Audio Classification

Transformers

#Audio classification #ONNX optimization #Audioset fine-tuning

Downloads 684

Release Time : 5/1/2025

Model Overview

This model is an audio classification model based on the Transformer architecture. It processes audio by converting it into spectrograms and is suitable for various audio recognition and classification tasks.

Model Features

ONNX format

The model has been converted to the ONNX format, facilitating deployment and use on different platforms and frameworks.

Audio classification

A Transformer model specifically optimized for audio classification tasks.

Spectrogram processing

Converts audio signals into spectrograms for efficient processing.

Model Capabilities

Audio classification

Sound event detection

Audio feature extraction

Use Cases

Multimedia analysis

Sound event detection

Identify and classify specific sound events in audio.

Achieved an mAP of 0.4593 on the AudioSet dataset.

Content classification

Classify audio content, such as music, speech, environmental sounds, etc.

Intelligent monitoring

Abnormal sound detection

Detect abnormal or dangerous sounds in monitored audio.

Property	Details
Library Name	transformers.js
Model Type	ast-finetuned-audioset-10-10-0.4593 (ONNX)
Base Model	MIT/ast-finetuned-audioset-10-10-0.4593

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Ast Finetuned Audioset 10 10 0.4593 ONNX

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 ast-finetuned-audioset-10-10-0.4593 (ONNX)

🚀 Quick Start

📚 Documentation

Information Table