Google-ViT-Base-Patch16-224 Open-Source Cartoon Emotion Detection Model - Accurately Classify Emotions in Cartoon Images

Google Vit Base Patch16 224 Cartoon Emotion Detection

Developed by jayanta

A fine-tuned cartoon image emotion classification model based on Google Vision Transformer (ViT) architecture, achieving 88% accuracy on the test set

Image Classification

Transformers

Open Source License:Apache-2.0 #Cartoon Emotion Recognition #High-precision Classification #ViT Architecture

Downloads 25

Release Time : 1/22/2023

Model Overview

This model is specifically designed to recognize emotions expressed in cartoon images, fine-tuned on a custom image dataset based on the ViT architecture

Model Features

High Accuracy Emotion Recognition

Achieves 88.07% accuracy and 87.83% F1 score on the test set

Based on ViT Architecture

Utilizes the Vision Transformer base model with excellent image feature extraction capabilities

End-to-End Training

The model learns features directly from raw pixels without complex preprocessing

Model Capabilities

Cartoon Image Classification

Emotion Recognition

Image Feature Extraction

Use Cases

Entertainment Applications

Cartoon Expression Analysis

Analyze the emotional state of cartoon character expressions

Can identify multiple basic emotions

Content Moderation

Children's Content Filtering

Identify emotional tendencies in cartoon content

Training Loss	Epoch	Step	Validation Loss	Accuracy	Precision	Recall	F1
No log	0.97	8	0.9902	0.5596	0.5506	0.5596	0.5360
1.242	1.97	16	0.5157	0.8165	0.8195	0.8165	0.8132
0.4438	2.97	24	0.3871	0.8440	0.8516	0.8440	0.8446
0.1768	3.97	32	0.3531	0.8624	0.8653	0.8624	0.8585
0.0661	4.97	40	0.3780	0.8716	0.8693	0.8716	0.8674
0.0661	5.97	48	0.3747	0.8624	0.8649	0.8624	0.8632
0.0375	6.97	56	0.3760	0.8991	0.8961	0.8991	0.8971
0.0362	7.97	64	0.4092	0.8716	0.8684	0.8716	0.8681
0.0322	8.97	72	0.3499	0.8899	0.8880	0.8899	0.8888
0.029	9.97	80	0.3706	0.8807	0.8769	0.8807	0.8783

Property	Details
Model Type	Fine - tuned version of google/vit-base-patch16-224
Training Data	imagefolder dataset
Metrics	accuracy, precision, recall, f1

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Google Vit Base Patch16 224 Cartoon Emotion Detection

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 google-vit-base-patch16-224-cartoon-emotion-detection

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License