W

Whisper Base Finetuned Gtzan

Developed by vineetsharma
A speech classification model fine-tuned on the GTZAN dataset based on OpenAI's whisper-base model, primarily used for music genre classification tasks.
Downloads 15
Release Time : 7/3/2023

Model Overview

This model is a variant based on the whisper-base architecture, specifically optimized for music genre classification tasks. It achieved an accuracy of 87% on the GTZAN dataset.

Model Features

High Accuracy
Achieved 87% classification accuracy on the GTZAN test set
Fine-tuned Optimization
Optimized specifically for music classification tasks based on the whisper-base model
Lightweight
Based on whisper-base architecture, relatively lightweight (inferred)

Model Capabilities

Music Genre Classification
Audio Feature Extraction

Use Cases

Music Analysis
Automatic Music Genre Classification
Classify music clips by genre
87% accuracy
Music Recommendation System
Serve as a preprocessing component for music recommendation systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase