W

W2v Speech Emotion Recognition

Developed by Khoa
A Wav2Vec2-fine-tuned English speech emotion recognition model capable of identifying six emotional states
Downloads 147
Release Time : 8/27/2024

Model Overview

This model is specifically designed to recognize emotional states in English speech, including sadness, anger, disgust, fear, happiness, and neutrality. It is fine-tuned on the Kaggle speech emotion recognition dataset based on the Wav2Vec2 architecture.

Model Features

Multi-emotion recognition
Capable of identifying six different emotional states: sadness, anger, disgust, fear, happiness, and neutrality
High accuracy
Achieves an accuracy of 0.7435 on the test set, with particularly excellent performance in recognizing anger and neutral emotions
Based on Wav2Vec2 architecture
Leverages the powerful feature extraction capabilities of Wav2Vec2, making it suitable for speech emotion recognition tasks

Model Capabilities

English speech emotion recognition
Six-emotion classification
Audio feature extraction

Use Cases

Emotion analysis
Customer service call analysis
Analyze customer emotions in service calls
Helps identify dissatisfied customers and improve service quality
Mental health monitoring
Monitor user emotional states through speech analysis
Assists in mental health assessment and early intervention
Human-computer interaction
Smart assistant emotional response
Enables smart assistants to adjust responses based on user speech emotions
Enhances the naturalness and emotional resonance of human-computer interaction
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase