Ast Finetuned Audioset 10 10 0.4593 Finetuning ESC 50
This model is an audio classification model based on the AST architecture, pre-trained on the AudioSet dataset and fine-tuned on the ESC-50 dataset, achieving an accuracy of 94.64% on the evaluation set.
Downloads 24
Release Time : 12/7/2022
Model Overview
A deep learning model for audio classification, particularly suitable for environmental sound classification tasks.
Model Features
High Accuracy
Achieves a classification accuracy of 94.64% on the ESC-50 evaluation set.
Transformer-based Architecture
Uses the AST (Audio Spectrogram Transformer) architecture to process audio data.
Transfer Learning
Pre-trained on the AudioSet dataset and fine-tuned on ESC-50.
Model Capabilities
Environmental Sound Classification
Audio Feature Extraction
Sound Event Detection
Use Cases
Environmental Monitoring
Urban Sound Classification
Identify and classify various sounds in urban environments.
Can accurately recognize multiple environmental sounds such as traffic, construction, and nature.
Smart Home
Home Anomaly Sound Detection
Detect abnormal sounds in home environments, such as glass breaking or alarms.
Featured Recommended AI Models
Š 2025AIbase