CREMA_D_Model Open-Source Speech Emotion Recognition Model - Free to Use, with an Accuracy of 73.22% on the Evaluation Set

CREMA D Model

Developed by jdmartinev

A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving 73.22% accuracy on the evaluation set

Audio Classification

Transformers

Open Source License:Apache-2.0 #Speech Emotion Recognition #wav2vec2 Fine-tuning #High Accuracy

Downloads 21

Release Time : 5/3/2023

Model Overview

This model is a speech emotion recognition model based on the wav2vec2 architecture, capable of identifying emotion categories from speech

Model Features

High Accuracy

Achieves 73.22% accuracy on the evaluation set, outperforming random guessing

Based on wav2vec2 Architecture

Uses the proven wav2vec2-base as the base model, with strong speech feature extraction capabilities

End-to-End Training

The model can directly learn from raw speech waveforms and predict emotion categories

Model Capabilities

Speech Emotion Recognition

Speech Feature Extraction

Emotion Classification

Use Cases

Human-Computer Interaction

Smart Customer Service Emotion Analysis

Analyzes the emotional state in customer speech to help the customer service system provide more human-like responses

Mental Health

Emotional State Monitoring

Analyzes users' emotional changes through speech for mental health applications

Training Loss	Epoch	Step	Validation Loss	Accuracy
1.7381	0.99	37	1.6700	0.3359
1.4143	1.99	74	1.4013	0.4878
1.1738	2.98	111	1.1820	0.6029
1.0229	4.0	149	1.0244	0.6532
0.8688	4.99	186	0.9101	0.7036
0.7578	5.99	223	0.8787	0.7112
0.705	6.98	260	0.8292	0.7229
0.6469	8.0	298	0.8509	0.7179
0.5684	8.99	335	0.8412	0.7288
0.5611	9.93	370	0.8221	0.7322

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

CREMA D Model

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 CREMA_D_Model

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License