W2v Speech Emotion Recognition
W
W2v Speech Emotion Recognition
Developed by Khoa
A Wav2Vec2-fine-tuned English speech emotion recognition model capable of identifying six emotional states
Downloads 147
Release Time : 8/27/2024
Model Overview
This model is specifically designed to recognize emotional states in English speech, including sadness, anger, disgust, fear, happiness, and neutrality. It is fine-tuned on the Kaggle speech emotion recognition dataset based on the Wav2Vec2 architecture.
Model Features
Multi-emotion recognition
Capable of identifying six different emotional states: sadness, anger, disgust, fear, happiness, and neutrality
High accuracy
Achieves an accuracy of 0.7435 on the test set, with particularly excellent performance in recognizing anger and neutral emotions
Based on Wav2Vec2 architecture
Leverages the powerful feature extraction capabilities of Wav2Vec2, making it suitable for speech emotion recognition tasks
Model Capabilities
English speech emotion recognition
Six-emotion classification
Audio feature extraction
Use Cases
Emotion analysis
Customer service call analysis
Analyze customer emotions in service calls
Helps identify dissatisfied customers and improve service quality
Mental health monitoring
Monitor user emotional states through speech analysis
Assists in mental health assessment and early intervention
Human-computer interaction
Smart assistant emotional response
Enables smart assistants to adjust responses based on user speech emotions
Enhances the naturalness and emotional resonance of human-computer interaction
Featured Recommended AI Models
Š 2025AIbase