# ViT fine-tuned model

Vit Base Beans
Apache-2.0
An image classification model based on Google Vision Transformer (ViT) architecture, specifically fine-tuned for the beans dataset
Image Classification Transformers
V
HieuVo
49
1
Tomato Leaf Disease Classification Vit
Apache-2.0
A fine-tuned tomato leaf disease classification model based on Google Vision Transformer (ViT) architecture, achieving 99.67% accuracy on the evaluation set
Image Classification Transformers
T
wellCh4n
55
1
Facial Expression Detection
A facial expression recognition model fine-tuned based on a pre-trained model, which can effectively recognize eight different facial expressions.
Face-related Transformers
F
HardlyHumans
1,266
1
UL Base Classification
Apache-2.0
This model is a fine-tuned image classification model based on Google's ViT-base-patch16-224 on an image folder dataset, achieving 89.21% accuracy on the validation set.
Image Classification Transformers
U
sharmajai901
2,432
1
Emotion Image Classification V2
Apache-2.0
A fine-tuned emotion image classification model based on Google's ViT model, achieving an accuracy of 59.38% on the validation set.
Image Classification Transformers
E
jhoppanne
2,176
2
Cat Vs Dog Classification
Apache-2.0
An image classification model fine-tuned on the cats_vs_dogs dataset using Google's ViT model, designed to distinguish between images of cats and dogs.
Image Classification Transformers
C
kazuma313
42
1
Stool Condition Classification
Apache-2.0
Fine-tuned based on Google's ViT model for fecal image classification with an accuracy of 94.17%
Image Classification Transformers
S
hossay
110
2
Pokemon Classifier
Apache-2.0
An image classification model fine-tuned on the Pokemon classification dataset based on Google's ViT model
Image Classification Transformers
P
merve
143
2
Carmodel
Apache-2.0
A vision model fine-tuned based on google/vit-base-patch16-224, achieving an F1 score of 0.9931 on the evaluation set
Image Classification Transformers
C
TechRoC123
24
1
Watermark Detector
Apache-2.0
A vision model based on the ViT architecture for detecting watermarks in images
Image Classification Transformers
W
amrul-hzz
2,709
8
Vit Base Patch16 224 In21k Finetuned Moderation
Apache-2.0
An image classification model based on Google's Vision Transformer architecture, fine-tuned specifically for content moderation tasks, achieving 90.43% accuracy on the test set
Image Classification Transformers
V
mbehbooei
752
3
Vit Bach Demo
Apache-2.0
A vision Transformer model fine-tuned based on google/vit-base-patch16-224, suitable for image classification tasks
Image Classification Transformers
V
tcvrishank
16
0
Vit Base Patch16 224 Finetuned Main Gpu 30e Final
Apache-2.0
Fine-tuned version based on Google's ViT model, achieving 99.4% validation accuracy on image classification tasks
Image Classification Transformers
V
Gokulapriyan
38
0
Vit Base Aiornot
Apache-2.0
A vision model fine-tuned based on google/vit-base-patch16-224, specific purpose not clearly stated
Image Classification Transformers
V
ThankGod
39
0
Vit Base Patch16 224 Finetuned Flower
Apache-2.0
Vision Transformer model fine-tuned on flower image dataset based on Google's ViT model
Image Classification Transformers
V
fzaghloul
35
0
Tomato Disease Detection
Apache-2.0
A tomato disease image classification model based on Vision Transformer architecture, achieving 99.18% accuracy on the evaluation set
Image Classification Transformers
T
surprisedPikachu007
35
0
Tomato Disease Detection V2
Apache-2.0
A tomato disease image classification model based on Google Vision Transformer (ViT) architecture with 98.87% accuracy
Image Classification Transformers
T
surprisedPikachu007
16
0
Vit Base Patch16 224 In21k Plant Seedling Classification
Apache-2.0
This is an image classification model fine-tuned based on google/vit-base-patch16-224-in21k, specifically designed for plant seedling classification tasks, achieving 95.67% accuracy on the test set.
Image Classification Transformers
V
uisikdag
40
1
Vit Model Julio Test
Apache-2.0
This model is an image classification model fine-tuned on the beans dataset based on Google's ViT architecture, achieving 97.74% accuracy on the validation set.
Image Classification Transformers
V
osarez-group
18
0
Vit Base Patch16 224 In21k Eurosat
Apache-2.0
A pre-trained model based on Google's Vision Transformer (ViT) architecture, fine-tuned on the EuroSAT dataset, suitable for remote sensing image classification tasks.
Image Classification Transformers
V
ingeniou
25
0
Vit Base Patch16 224 Finetuned Og Dataset 10e
Apache-2.0
A vision Transformer model fine-tuned on a custom image dataset based on Google's ViT model, achieving an evaluation accuracy of 97.7%
Image Classification Transformers
V
Gokulapriyan
17
0
Hq Fer2013notest
Apache-2.0
An image classification model based on ViT architecture, fine-tuned on the FER2013 dataset for facial expression recognition tasks.
Image Classification Transformers
H
Piro17
37
0
Hq Fer2013
Apache-2.0
A facial expression recognition model fine-tuned based on Google's ViT model, trained on the FER2013 dataset with an accuracy of 70.22%.
Image Classification Transformers
H
Piro17
38
0
Finetuned Affecthq
Apache-2.0
An image classification model fine-tuned based on google/vit-base-patch16-224-in21k, trained on an image folder dataset with an evaluation accuracy of 71.79%.
Image Classification Transformers
F
Piro17
18
0
Vit Base Patch16 224 In21k Fog Or Smog Classification
Apache-2.0
Image classification model fine-tuned based on google/vit-base-patch16-224-in21k, achieving 91% accuracy on the test set
Image Classification Transformers
V
uisikdag
19
0
Vit Base Patch32 224 In21k Finetuned Eurosat
Apache-2.0
An image classification model based on Google's Vision Transformer (ViT) architecture, fine-tuned on the EuroSAT dataset for satellite image classification tasks
Image Classification Transformers
V
keithanpai
20
0
Vit Base Patch16 224 In21k Dog Vs Cat Image Classification
Apache-2.0
A cat and dog image classification model fine-tuned based on Google Vision Transformer (ViT) architecture, achieving 99% accuracy on the test set
Image Classification Transformers English
V
DunnBC22
20
1
Platzi Vit Model Omar Espejel
Apache-2.0
This is an image classification model fine-tuned on the beans dataset based on Google's ViT model, achieving an accuracy of 98.5%.
Image Classification Transformers
P
platzi
23
0
Vit Base Cifar10
Apache-2.0
Image classification model fine-tuned on CIFAR-10 dataset based on ViT architecture
Image Classification Transformers
V
simlaharma
39
0
Perros VS Gatos Con Vit Base Patch16 224 In21k
Apache-2.0
This model is a fine-tuned version of google/vit-base-patch16-224-in21k, designed for image classification tasks to distinguish between cats and dogs.
Image Classification Transformers
P
julenalvaro
19
0
New Vit
Apache-2.0
A vision Transformer model fine-tuned based on Google's ViT foundation model, suitable for image classification tasks
Image Classification Transformers
N
shriramkv
36
0
Fine Tuned Vit Trash Classification
Apache-2.0
Image classification model based on ViT architecture, specifically fine-tuned for trash classification tasks
Image Classification Transformers
F
Aalaa
46
1
Blossom Vit
Apache-2.0
A vision Transformer model fine-tuned based on google/vit-base-patch16-224, with unspecified specific use case and training data
Image Classification Transformers
B
taraqur
24
0
Vit Base Patch16 224 Finetuned Eurosat
Apache-2.0
Vision Transformer model based on ViT architecture, achieving 98.89% accuracy after fine-tuning on image classification tasks
Image Classification Transformers
V
Weili
32
0
Dataset Model2
Apache-2.0
An image classification model fine-tuned based on Google's ViT-base model, achieving an accuracy of 87.98% on the evaluation set.
Image Classification Transformers
D
Farideh
31
0
Vit Base Patch16 224 In21k Finetuned Cassava
Apache-2.0
Image classification model based on Google Vision Transformer (ViT) architecture, fine-tuned on image folder dataset with 87.06% accuracy
Image Classification Transformers
V
siddharth963
31
1
Vit Base Patch16 224 Wi2
Apache-2.0
Vision Transformer model fine-tuned from google/vit-base-patch16-224, suitable for image classification tasks
Image Classification Transformers
V
Imene
21
0
Vit Base Patch16 224 In21k Wr
Apache-2.0
This model is a fine-tuned Vision Transformer based on google/vit-base-patch16-224-in21k on an unknown dataset, primarily used for image classification tasks.
Image Classification Transformers
V
Imene
21
0
Vit Base Patch16 224 In21k Wwwwwi
Apache-2.0
This model is a fine-tuned Vision Transformer based on google/vit-base-patch16-224-in21k on an unknown dataset, primarily used for image classification tasks.
Image Classification Transformers
V
Imene
21
0
Vc Bantai Vit Withoutambi Adunest V1
Apache-2.0
High-precision image classification model fine-tuned based on Google's ViT-base model, achieving 91.81% accuracy on the evaluation set
Image Classification Transformers
V
AykeeSalazar
28
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase