Whisper Base Finetuned Gtzan
A speech classification model fine-tuned on the GTZAN dataset based on OpenAI's whisper-base model, primarily used for music genre classification tasks.
Downloads 15
Release Time : 7/3/2023
Model Overview
This model is a variant based on the whisper-base architecture, specifically optimized for music genre classification tasks. It achieved an accuracy of 87% on the GTZAN dataset.
Model Features
High Accuracy
Achieved 87% classification accuracy on the GTZAN test set
Fine-tuned Optimization
Optimized specifically for music classification tasks based on the whisper-base model
Lightweight
Based on whisper-base architecture, relatively lightweight (inferred)
Model Capabilities
Music Genre Classification
Audio Feature Extraction
Use Cases
Music Analysis
Automatic Music Genre Classification
Classify music clips by genre
87% accuracy
Music Recommendation System
Serve as a preprocessing component for music recommendation systems
Featured Recommended AI Models
Š 2025AIbase