Model Selection

Speech Emotion Recognition

# Speech Emotion Recognition

Whisper Large V3 Msp Podcast Emotion

A speech emotion recognition model based on Whisper-Large V3, optimized for the MSP-Podcast dataset, supporting 9 emotion classifications

Audio Classification

Safetensors English

Ast Finetuned Model

This is a fine-tuned model based on Audio Spectrogram Transformer (AST), specifically designed for emotion classification in speech audio.

Audio Classification

Transformers English

Wavlm Large Finetuned SER

A speech emotion recognition model based on WavLM-Large, supporting English speech emotion classification.

Audio Classification English

Speech Emotion Recognition With Openai Whisper Large V3

This project utilizes the Whisper model for speech emotion recognition, capable of classifying audio into different emotional categories such as happiness, sadness, and surprise.

Audio Classification

Speechbrain Emotion Recognition Openvino

This model uses a fine-tuned wav2vec2 (base) architecture, trained on the IEMOCAP dataset for speech emotion recognition tasks.

Audio Classification English

SER Odyssey Baseline WavLM Categorical

A baseline model for speech emotion recognition based on the WavLM architecture, designed to predict 8 basic emotion categories

Audio Classification

Transformers English

Speech Emotion Recognition Wav2vec2 Large Xlsr 53 240304 SER Fine Tuned2.0

A speech emotion recognition model based on wav2vec2-large-xlsr-53, supporting 7 emotion classifications

Audio Classification

Wav2vec2 Large Xlsr 53 English Finetuned Ravdess

A speech emotion recognition model fine-tuned on the RAVDESS dataset based on the wav2vec2-large-xlsr-53-english model

Audio Classification

Wav2vec2 Audio Emotion Classification

A fine-tuned audio emotion classification model based on facebook/wav2vec2-base for analyzing emotional states in speech

Audio Classification

Wav2vec2 Audio Emotion Classification

A fine-tuned audio emotion classification model based on facebook/wav2vec2-base, achieving 73.98% accuracy on the evaluation set

Audio Classification

Wav2vec2 Lg Xlsr En Speech Emotion Recognition Finetuned Ravdess V8

English speech emotion recognition model based on wav2vec2 architecture, fine-tuned on the RAVDESS dataset

Audio Classification

Emotion Diarization Wavlm Large

Fine-tuned using the WavLM Large model for speech emotion recognition and speaker diarization analysis, supporting multiple emotion classifications

Audio Classification English

Distilhubert Finetuned Ravdess

A speech emotion recognition model fine-tuned on the RAVDESS dataset based on DistilHuBERT architecture, achieving 92.36% accuracy

Audio Classification

Finetuned Wav2vec2.0 Base On IEMOCAP 2

This is a speech emotion recognition model based on the facebook/wav2vec2-base model fine-tuned on the IEMOCAP dataset, achieving 73.9% accuracy on the evaluation set.

Audio Classification

A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving 73.22% accuracy on the evaluation set

Audio Classification

Wav2vec2 Base Toronto Emotional Speech Set

An audio emotion classification model fine-tuned based on wav2vec2-base, used to identify the speaker's emotional state.

Audio Classification

Transformers English

Astie Finetuned On Shemo

This model is a fine-tuned version of the AST model on the shEMO dataset, primarily used for speech emotion recognition tasks.

Audio Classification

Iewav2vec2 Finetuned On Shemo

This model is a fine-tuned version of minoosh/wav2vec2-base-finetuned-ie on the shEMO dataset, primarily used for speech emotion recognition tasks.

Audio Classification

Ser Model Adjusted 2023 03 03

A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving an accuracy of 75.73% on the evaluation set

Audio Classification

Ser Model Fixed Label

A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving an accuracy of 83.67% on the evaluation set

Audio Classification

A fine-tuned speech emotion recognition model based on facebook/wav2vec2-base, achieving 84.71% accuracy on the evaluation set

Audio Classification

Wav2vec2 Base Finetuned Sentiment Mesd

A Spanish audio sentiment classification model fine-tuned on the MESD dataset based on facebook/wav2vec2-base

Audio Classification

somosnlp-hackathon-2022

Wav2vec2 Base Superb Er

This is a speech emotion recognition model based on the Wav2Vec2 architecture, adapted from the S3PRL project, designed to identify emotional categories in speech.

Audio Classification

Transformers English

Wav2vec2 Large Superb Er

This is an emotion recognition model based on the Wav2Vec2-Large model, specifically designed to identify emotion categories from speech.

Audio Classification

Transformers English

Xlsr Wav2vec Speech Emotion Recognition

A speech emotion recognition model based on the XLSR-Wav2Vec architecture, capable of identifying five basic emotions: anger, disgust, fear, happiness, and sadness.

Audio Classification

Transformers English

Hubert Base Superb Er

This model is an emotion recognition model based on the Hubert-Base architecture, trained on the SUPERB emotion recognition task for speech emotion classification

Audio Classification

Transformers English

Wav2vec2 Lg Xlsr En Speech Emotion Recognition

A speech emotion recognition model fine-tuned on Wav2Vec 2.0, capable of identifying 8 English emotions with an accuracy of 82.23% on the RAVDESS dataset

Audio Classification

Hubert Large Superb Er

An emotion recognition model based on Hubert-Large pre-trained model for predicting emotion categories in speech

Audio Classification

Transformers English

A speech emotion recognition model based on the HuBERT architecture, capable of identifying the emotional state of a speaker from audio.

Audio Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase