# Speech Emotion Recognition
Whisper Large V3 Msp Podcast Emotion
A speech emotion recognition model based on Whisper-Large V3, optimized for the MSP-Podcast dataset, supporting 9 emotion classifications
Audio Classification
Safetensors English
W
tiantiaf
282
3
Ast Finetuned Model
Apache-2.0
This is a fine-tuned model based on Audio Spectrogram Transformer (AST), specifically designed for emotion classification in speech audio.
Audio Classification
Transformers English

A
forwarder1121
174
0
Wavlm Large Finetuned SER
A speech emotion recognition model based on WavLM-Large, supporting English speech emotion classification.
Audio Classification English
W
JBJoyce
139
0
Speech Emotion Recognition With Openai Whisper Large V3
Apache-2.0
This project utilizes the Whisper model for speech emotion recognition, capable of classifying audio into different emotional categories such as happiness, sadness, and surprise.
Audio Classification
Transformers

S
firdhokk
7,750
33
Speechbrain Emotion Recognition Openvino
Apache-2.0
This model uses a fine-tuned wav2vec2 (base) architecture, trained on the IEMOCAP dataset for speech emotion recognition tasks.
Audio Classification English
S
psakamoori
13
0
SER Odyssey Baseline WavLM Categorical
MIT
A baseline model for speech emotion recognition based on the WavLM architecture, designed to predict 8 basic emotion categories
Audio Classification
Transformers English

S
3loi
581
8
Speech Emotion Recognition Wav2vec2 Large Xlsr 53 240304 SER Fine Tuned2.0
Apache-2.0
A speech emotion recognition model based on wav2vec2-large-xlsr-53, supporting 7 emotion classifications
Audio Classification
Transformers

S
hughlan1214
145
2
Wav2vec2 Large Xlsr 53 English Finetuned Ravdess
Apache-2.0
A speech emotion recognition model fine-tuned on the RAVDESS dataset based on the wav2vec2-large-xlsr-53-english model
Audio Classification
Transformers

W
firdho26
68
0
Wav2vec2 Audio Emotion Classification
Apache-2.0
A fine-tuned audio emotion classification model based on facebook/wav2vec2-base for analyzing emotional states in speech
Audio Classification
Transformers

W
dhanush23
15
0
Wav2vec2 Audio Emotion Classification
Apache-2.0
A fine-tuned audio emotion classification model based on facebook/wav2vec2-base, achieving 73.98% accuracy on the evaluation set
Audio Classification
Transformers

W
chin-may
77
5
Wav2vec2 Lg Xlsr En Speech Emotion Recognition Finetuned Ravdess V8
Apache-2.0
English speech emotion recognition model based on wav2vec2 architecture, fine-tuned on the RAVDESS dataset
Audio Classification
Transformers

W
Wiam
94
4
Emotion Diarization Wavlm Large
Apache-2.0
Fine-tuned using the WavLM Large model for speech emotion recognition and speaker diarization analysis, supporting multiple emotion classifications
Audio Classification English
E
speechbrain
1,128
52
Distilhubert Finetuned Ravdess
Apache-2.0
A speech emotion recognition model fine-tuned on the RAVDESS dataset based on DistilHuBERT architecture, achieving 92.36% accuracy
Audio Classification
Transformers

D
pollner
43
2
Finetuned Wav2vec2.0 Base On IEMOCAP 2
Apache-2.0
This is a speech emotion recognition model based on the facebook/wav2vec2-base model fine-tuned on the IEMOCAP dataset, achieving 73.9% accuracy on the evaluation set.
Audio Classification
Transformers

F
minoosh
32
2
CREMA D Model
Apache-2.0
A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving 73.22% accuracy on the evaluation set
Audio Classification
Transformers

C
jdmartinev
21
0
Wav2vec2 Base Toronto Emotional Speech Set
Apache-2.0
An audio emotion classification model fine-tuned based on wav2vec2-base, used to identify the speaker's emotional state.
Audio Classification
Transformers English

W
DunnBC22
185
3
Astie Finetuned On Shemo
Bsd-3-clause
This model is a fine-tuned version of the AST model on the shEMO dataset, primarily used for speech emotion recognition tasks.
Audio Classification
Transformers

A
minoosh
24
0
Iewav2vec2 Finetuned On Shemo
Apache-2.0
This model is a fine-tuned version of minoosh/wav2vec2-base-finetuned-ie on the shEMO dataset, primarily used for speech emotion recognition tasks.
Audio Classification
Transformers

I
minoosh
20
0
Ser Model Adjusted 2023 03 03
Apache-2.0
A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving an accuracy of 75.73% on the evaluation set
Audio Classification
Transformers

S
aherzberg
18
0
Ser Model Fixed Label
Apache-2.0
A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving an accuracy of 83.67% on the evaluation set
Audio Classification
Transformers

S
aherzberg
18
1
Ser Model
Apache-2.0
A fine-tuned speech emotion recognition model based on facebook/wav2vec2-base, achieving 84.71% accuracy on the evaluation set
Audio Classification
Transformers

S
aherzberg
30
0
Wav2vec2 Base Finetuned Sentiment Mesd
Apache-2.0
A Spanish audio sentiment classification model fine-tuned on the MESD dataset based on facebook/wav2vec2-base
Audio Classification
Transformers

W
somosnlp-hackathon-2022
28
5
Wav2vec2 Base Superb Er
Apache-2.0
This is a speech emotion recognition model based on the Wav2Vec2 architecture, adapted from the S3PRL project, designed to identify emotional categories in speech.
Audio Classification
Transformers English

W
superb
28.14k
11
Wav2vec2 Large Superb Er
Apache-2.0
This is an emotion recognition model based on the Wav2Vec2-Large model, specifically designed to identify emotion categories from speech.
Audio Classification
Transformers English

W
superb
1,442
1
Xlsr Wav2vec Speech Emotion Recognition
Apache-2.0
A speech emotion recognition model based on the XLSR-Wav2Vec architecture, capable of identifying five basic emotions: anger, disgust, fear, happiness, and sadness.
Audio Classification
Transformers English

X
harshit345
498
62
Hubert Base Superb Er
Apache-2.0
This model is an emotion recognition model based on the Hubert-Base architecture, trained on the SUPERB emotion recognition task for speech emotion classification
Audio Classification
Transformers English

H
superb
7,887
20
Wav2vec2 Lg Xlsr En Speech Emotion Recognition
Apache-2.0
A speech emotion recognition model fine-tuned on Wav2Vec 2.0, capable of identifying 8 English emotions with an accuracy of 82.23% on the RAVDESS dataset
Audio Classification
Transformers

W
ehcalabres
39.83k
221
Hubert Large Superb Er
Apache-2.0
An emotion recognition model based on Hubert-Large pre-trained model for predicting emotion categories in speech
Audio Classification
Transformers English

H
superb
10.24k
21
Hubert Emotion
A speech emotion recognition model based on the HuBERT architecture, capable of identifying the emotional state of a speaker from audio.
Audio Classification
Transformers

H
Rajaram1996
76
32
Featured Recommended AI Models