Finetuned ViT Indian Food Classification V3
F
Finetuned ViT Indian Food Classification V3
Developed by DrishtiSharma
This model is a fine-tuned image classification model based on google/vit-base-patch16-224-in21k on the Human_Action_Recognition dataset, achieving an accuracy of 93.84%.
Downloads 60
Release Time : 9/3/2022
Model Overview
This is an image classification model based on the Vision Transformer (ViT) architecture, specifically designed for recognizing Indian food categories. The model has been fine-tuned on the Human_Action_Recognition dataset and performs excellently.
Model Features
High accuracy
Achieves 93.84% accuracy on the evaluation set, demonstrating excellent performance.
Based on ViT architecture
Utilizes the advanced Vision Transformer architecture, effectively capturing global image features.
Efficient fine-tuning
Efficiently fine-tuned on a pre-trained model, saving training resources.
Model Capabilities
Image classification
Food recognition
Visual feature extraction
Use Cases
Food and beverage industry
Automatic dish recognition
Used in automatic dish classification systems for restaurants or food delivery platforms
Accurately identifies various Indian food items
Health applications
Diet recording assistance
Mobile applications that help users automatically record their dietary intake
Automatically identifies food types, simplifying the recording process
Featured Recommended AI Models
Š 2025AIbase