W

Wav2vec2 Large Xlsr 53 Th Speech Emotion Recognition 3c 10ep

Developed by Paranchai
A speech emotion recognition model fine-tuned based on airesearch/wav2vec2-large-xlsr-53-th, achieving 85.79% accuracy on the evaluation set
Downloads 9
Release Time : 10/14/2024

Model Overview

This model is a fine-tuned wav2vec2 model for Thai speech emotion recognition tasks, capable of identifying emotion categories in speech

Model Features

High accuracy
Achieves 85.79% emotion recognition accuracy on the evaluation set
Based on pre-trained model
Fine-tuned from the powerful airesearch/wav2vec2-large-xlsr-53-th model
Optimized training
Precisely tuned for 10 epochs using linear learning rate scheduling with warm-up

Model Capabilities

Thai speech emotion recognition
Speech feature extraction
Three-class emotion recognition

Use Cases

Emotion analysis
Customer service call emotion analysis
Analyze customer emotional states in service calls
Can identify 85.79% of emotion categories
Mental health monitoring
Analyze user emotional states through speech
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase