W

Wav2vec2 Base Music Speech Both Classification Finetuned Gtzan

Developed by 0bi0n3
Audio classification model based on wav2vec2 architecture, fine-tuned on the GTZAN dataset for music and speech classification tasks
Downloads 15
Release Time : 9/16/2023

Model Overview

This model is an audio classification model based on the wav2vec2 architecture, specifically fine-tuned for music and speech classification tasks. It achieved an accuracy of 85% on the GTZAN dataset.

Model Features

High Accuracy
Achieves 85% classification accuracy on the GTZAN dataset
Based on wav2vec2 Architecture
Utilizes the advanced wav2vec2 architecture for audio feature extraction and classification
Music/Speech Classification
Specifically optimized for music and speech classification tasks

Model Capabilities

Audio Classification
Music Recognition
Speech Recognition

Use Cases

Audio Content Analysis
Music Streaming Classification
Automatically identifies music content in audio streams
85% accuracy
Speech Content Detection
Identifies speech content in mixed audio
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase