wav2vec2-base Music and Speech Classification Model - Open Source for Precise Distinction between Music and Speech

Home

Wav2vec2 Base Music Speech Both Classification

Developed by FerhatDk

An audio classification model fine-tuned based on facebook/wav2vec2-base for distinguishing between music and speech

Audio Classification

Transformers

Open Source License:Apache-2.0 #Audio Classification #High Accuracy #Music and Speech Recognition

Downloads 20

Release Time : 7/10/2023

Model Overview

This model is a fine-tuned audio classifier based on the wav2vec2-base architecture, specifically designed to differentiate between music and speech audio content. It achieved 98.47% accuracy on the evaluation set.

Model Features

High Accuracy

Achieved 98.47% classification accuracy on the evaluation set

Based on wav2vec2 Architecture

Utilizes the wav2vec2-base pre-trained model for fine-tuning, with excellent audio feature extraction capabilities

Lightweight Training

Requires only 8 training epochs to achieve high performance

Model Capabilities

Audio Classification

Music Recognition

Speech Recognition

Use Cases

Audio Content Analysis

Automatic Music/Speech Classification

Automatically identifies whether audio content is music or speech

98.47% accuracy

Media Management

Automatic Audio Library Classification

Automatically adds music/speech tags to content in audio libraries

Training Loss	Epoch	Step	Validation Loss	Accuracy
0.9458	1.0	66	0.8468	0.7405
0.3785	2.0	132	0.2951	0.9771
0.1762	3.0	198	0.2639	0.9313
0.134	4.0	264	0.1084	0.9771
0.0782	5.0	330	0.0877	0.9771
0.0568	6.0	396	0.0912	0.9771
0.0122	7.0	462	0.4056	0.9198
0.059	8.0	528	0.0586	0.9847

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Base Music Speech Both Classification

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2-base_music_speech_both_classification

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License