Wav2vec2-base-finetuned-gtzan Open-source Audio Classification Model

Wav2vec2 Base Finetuned Gtzan

Developed by wilson-wei

This model is an audio classification model fine-tuned on the GTZAN dataset based on facebook/wav2vec2-base, primarily used for music genre classification tasks.

Audio Classification

Transformers

Open Source License:Apache-2.0 #Audio Classification #Music Genre Recognition #High Accuracy

Downloads 14

Release Time : 7/29/2023

Model Overview

An audio classification model based on the wav2vec2 architecture, fine-tuned on the GTZAN dataset, capable of recognizing 10 different music genres.

Model Features

High Accuracy

Achieves 84% accuracy on the GTZAN test set

Based on wav2vec2 Architecture

Utilizes a self-supervised learning pre-trained speech representation model

Lightweight

Based on the wav2vec2-base version, relatively small in size

Model Capabilities

Music Genre Classification

Audio Feature Extraction

Use Cases

Music Analysis

Automatic Music Genre Classification

Automatically classify music clips by genre

84% accuracy

Music Recommendation System

Serves as a feature extraction component for music recommendation systems

Training Loss	Epoch	Step	Validation Loss	Accuracy
1.9838	1.0	113	1.8627	0.37
1.6128	2.0	226	1.5998	0.48
1.0259	3.0	339	1.3821	0.57
1.2766	4.0	452	1.1708	0.66
0.6014	5.0	565	0.7257	0.77
0.5815	6.0	678	1.0738	0.68
0.7664	7.0	791	0.7244	0.8
0.2303	8.0	904	0.5838	0.84
0.4829	9.0	1017	0.5741	0.87
0.0859	10.0	1130	0.6199	0.83
0.2983	11.0	1243	0.8117	0.84
0.0642	12.0	1356	0.5938	0.88
0.0688	13.0	1469	0.9978	0.84
0.1542	14.0	1582	0.7437	0.85
0.0117	15.0	1695	0.9100	0.84
0.039	16.0	1808	0.7757	0.85
0.0661	17.0	1921	0.8879	0.84

Property	Details
Model Type	Fine - tuned wav2vec2 - base on GTZAN dataset
Training Data	marsyas/gtzan

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Base Finetuned Gtzan

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2-base-finetuned-gtzan

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License