vit-Facial-Expression-Recognition Open-source Model - Accurately Recognize Seven Basic Facial Emotions

Vit Facial Expression Recognition

Developed by motheecreator

A ViT-based facial expression recognition model fine-tuned on FER2013, MMI, and AffectNet datasets, capable of recognizing seven basic emotions

Face-related

Transformers

#High-precision facial expression recognition #Multi-dataset fine-tuning #ViT architecture

Downloads 4,221

Release Time : 4/29/2024

Model Overview

This model is a Vision Transformer-based facial emotion recognition model specifically optimized for classifying seven emotions: anger, disgust, fear, happiness, sadness, surprise, and neutrality

Model Features

Multi-dataset training

Trained on three mainstream facial expression datasets (FER2013, MMI, and AffectNet) to enhance model generalization

Efficient Transformer architecture

Utilizes Vision Transformer architecture with self-attention mechanisms to effectively capture facial expression features

Data augmentation strategies

Applies random rotation, flipping, and scaling techniques during training to improve model robustness

Model Capabilities

Facial image analysis

Emotion classification

Real-time expression recognition

Use Cases

Human-computer interaction

Affective computing systems

Used to develop intelligent interaction systems that understand users' emotional states

84.34% accuracy

Mental health

Emotion monitoring applications

Can be used in mental health applications to automatically detect users' emotional changes

🚀 vit-Facial-Expression-Recognition

This model is a fine - tuned Vision Transformer for facial emotion recognition, leveraging multiple datasets to achieve high - accuracy emotion classification.

This model is a fine - tuned version of [google/vit - base - patch16 - 224 - in21k](https://huggingface.co/google/vit - base - patch16 - 224 - in21k) on the FER 2013, MMI Facial Expression Database, and [AffectNet dataset](https://www.kaggle.com/datasets/noamsegal/affectnet - training - data) datasets. It achieves the following results on the evaluation set:

Loss: 0.4503
Accuracy: 0.8434

🚀 Quick Start

This model can be used for facial emotion recognition tasks. First, ensure you have the necessary libraries installed. You can then load the model and perform predictions on facial images.

✨ Features

Fine - tuned for Emotion Recognition: Specifically optimized for the task of classifying facial emotions.
Multi - dataset Training: Trained on a combination of diverse datasets, enhancing its generalization ability.
High Accuracy: Achieves an accuracy of 0.8434 on the evaluation set.

📚 Documentation

Model description

The vit - face - expression model is a Vision Transformer fine - tuned for the task of facial emotion recognition.

It is trained on the FER2013, MMI facial Expression, and AffectNet datasets, which consist of facial images categorized into seven different emotions:

Angry
Disgust
Fear
Happy
Sad
Surprise
Neutral

Data Preprocessing

The input images are preprocessed before being fed into the model. The preprocessing steps include:

Resizing: Images are resized to the specified input size.
Normalization: Pixel values are normalized to a specific range.
Data Augmentation: Random transformations such as rotations, flips, and zooms are applied to augment the training dataset.

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 3e - 05
train_batch_size: 32
eval_batch_size: 32
seed: 42
gradient_accumulation_steps: 8
total_train_batch_size: 256
optimizer: Adam with betas=(0.9, 0.999) and epsilon = 1e - 08
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 1000
num_epochs: 3

Training results

Training Loss	Epoch	Step	Validation Loss	Accuracy
1.3548	0.17	100	0.8024	0.7418
1.047	0.34	200	0.6823	0.7653
0.9398	0.51	300	0.6264	0.7827
0.8618	0.67	400	0.5857	0.7973
0.8363	0.84	500	0.5532	0.8104
0.8018	1.01	600	0.5279	0.8196
0.7567	1.18	700	0.5110	0.8248
0.7521	1.35	800	0.5080	0.8259
0.741	1.52	900	0.5002	0.8271
0.7229	1.69	1000	0.4967	0.8263
0.7157	1.85	1100	0.4876	0.8326
0.6868	2.02	1200	0.4836	0.8342
0.6605	2.19	1300	0.4711	0.8384
0.6449	2.36	1400	0.4608	0.8406
0.6085	2.53	1500	0.4503	0.8434
0.6178	2.7	1600	0.4434	0.8478
0.6166	2.87	1700	0.4420	0.8486

Framework versions

Transformers 4.36.0
Pytorch 2.0.0
Datasets 2.1.0
Tokenizers 0.15.0

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご