Wav2vec2 Base Music Speech Both Classification Finetuned Gtzan
Audio classification model based on wav2vec2 architecture, fine-tuned on the GTZAN dataset for music and speech classification tasks
Downloads 15
Release Time : 9/16/2023
Model Overview
This model is an audio classification model based on the wav2vec2 architecture, specifically fine-tuned for music and speech classification tasks. It achieved an accuracy of 85% on the GTZAN dataset.
Model Features
High Accuracy
Achieves 85% classification accuracy on the GTZAN dataset
Based on wav2vec2 Architecture
Utilizes the advanced wav2vec2 architecture for audio feature extraction and classification
Music/Speech Classification
Specifically optimized for music and speech classification tasks
Model Capabilities
Audio Classification
Music Recognition
Speech Recognition
Use Cases
Audio Content Analysis
Music Streaming Classification
Automatically identifies music content in audio streams
85% accuracy
Speech Content Detection
Identifies speech content in mixed audio
Featured Recommended AI Models
Š 2025AIbase