C

Cnn8rnn Audioset Sed

Developed by wsntxxn
CRNN audio event detection model pre-trained on AudioSet and fine-tuned on AudioSet-strong
Downloads 229
Release Time : 8/13/2024

Model Overview

This is a deep learning model for sound event detection that can identify specific event categories in audio, such as speech, music, or environmental sounds.

Model Features

High Temporal Resolution
The model has a 40ms temporal resolution, enabling precise detection of audio event timings.
Multi-class Recognition
Can recognize 447 different audio event categories, including various types of speech, music, and environmental sounds.
Dual Output Mode
Provides both frame-level and clip-level outputs to meet detection needs at different precision levels.

Model Capabilities

Audio Classification
Sound Event Detection
Multi-class Audio Recognition
Temporal Localization of Audio Events

Use Cases

Audio Content Analysis
Speech Detection
Detects the presence of male or female speech in audio
Outputs probability sequences for specific speech categories
Environmental Sound Monitoring
Identifies specific sound events in the environment, such as alarms or animal sounds
Marks the occurrence time and category of sound events
Media Content Analysis
Video Auto-tagging
Automatically generates content tags by analyzing the audio track in videos
Improves video content retrieval efficiency
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase