Ast Finetuned Audioset 10 10 0.4593
Audio Spectrogram Transformer (AST) model fine-tuned on the AudioSet dataset for audio classification tasks
Downloads 82
Release Time : 6/27/2023
Model Overview
This model is a variant of the Audio Spectrogram Transformer (AST) architecture, specifically fine-tuned on the AudioSet dataset, suitable for general audio classification tasks. It can recognize and classify various audio events and sound categories.
Model Features
Transformer-based Audio Processing
Uses Vision Transformer architecture to process audio spectrograms, enabling global modeling of audio signals
AudioSet Fine-tuning
Fine-tuned on the large-scale AudioSet dataset, capable of recognizing a wide range of audio events
Web Adaptation
Provides ONNX format weights that can be directly run in browsers via Transformers.js
Model Capabilities
Audio Classification
Sound Event Detection
Environmental Sound Recognition
Use Cases
Smart Home
Pet Sound Monitoring
Detects and classifies sounds made by pets (e.g., cat meows, dog barks)
Can accurately identify common pet sounds
Content Moderation
Audio Content Classification
Automatically classifies user-uploaded audio content
Featured Recommended AI Models