Wav2vec2 Large Xlsr 53 Th Speech Emotion Recognition 3c 10ep
A speech emotion recognition model fine-tuned based on airesearch/wav2vec2-large-xlsr-53-th, achieving 85.79% accuracy on the evaluation set
Downloads 9
Release Time : 10/14/2024
Model Overview
This model is a fine-tuned wav2vec2 model for Thai speech emotion recognition tasks, capable of identifying emotion categories in speech
Model Features
High accuracy
Achieves 85.79% emotion recognition accuracy on the evaluation set
Based on pre-trained model
Fine-tuned from the powerful airesearch/wav2vec2-large-xlsr-53-th model
Optimized training
Precisely tuned for 10 epochs using linear learning rate scheduling with warm-up
Model Capabilities
Thai speech emotion recognition
Speech feature extraction
Three-class emotion recognition
Use Cases
Emotion analysis
Customer service call emotion analysis
Analyze customer emotional states in service calls
Can identify 85.79% of emotion categories
Mental health monitoring
Analyze user emotional states through speech
Featured Recommended AI Models
Š 2025AIbase