H

Hubert Large Audioset

Developed by ALM
A Transformer model based on the HuBERT architecture, pre-trained on the complete AudioSet dataset, suitable for general audio representation learning tasks.
Downloads 79
Release Time : 8/28/2023

Model Overview

This model is based on the HuBERT architecture and pre-trained on the diverse AudioSet dataset, capable of extracting general audio features for various audio processing tasks.

Model Features

General Audio Representation
Pre-trained on the diverse AudioSet dataset, capable of handling various audio types (speech, music, environmental sounds, etc.)
HuBERT-based Architecture
Utilizes HuBERT's self-supervised learning method to effectively capture temporal features of audio signals
Transfer Learning Friendly
Can be used as a feature extractor or fine-tuned for downstream tasks

Model Capabilities

Audio Feature Extraction
Music Classification
Acoustic Event Detection
Speech Recognition (Limited Capability)

Use Cases

Music Analysis
Music Genre Classification
Automatic music genre classification using features extracted by the model
Environmental Sound Analysis
Acoustic Event Detection
Detecting specific sound events in the environment (e.g., alarms, animal sounds)
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase