A

Ast Finetuned Audioset 10 10 0.4593

Developed by Xenova
Audio Spectrogram Transformer (AST) model fine-tuned on the AudioSet dataset for audio classification tasks
Downloads 82
Release Time : 6/27/2023

Model Overview

This model is a variant of the Audio Spectrogram Transformer (AST) architecture, specifically fine-tuned on the AudioSet dataset, suitable for general audio classification tasks. It can recognize and classify various audio events and sound categories.

Model Features

Transformer-based Audio Processing
Uses Vision Transformer architecture to process audio spectrograms, enabling global modeling of audio signals
AudioSet Fine-tuning
Fine-tuned on the large-scale AudioSet dataset, capable of recognizing a wide range of audio events
Web Adaptation
Provides ONNX format weights that can be directly run in browsers via Transformers.js

Model Capabilities

Audio Classification
Sound Event Detection
Environmental Sound Recognition

Use Cases

Smart Home
Pet Sound Monitoring
Detects and classifies sounds made by pets (e.g., cat meows, dog barks)
Can accurately identify common pet sounds
Content Moderation
Audio Content Classification
Automatically classifies user-uploaded audio content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase