# ImageNet-21k pretrained
Vit Large Patch16 224.orig In21k
Apache-2.0
A Vision Transformer (ViT) based image classification model, pretrained on ImageNet-21k by Google Research using JAX framework and later ported to PyTorch. Suitable for feature extraction and fine-tuning scenarios.
Image Classification
Transformers

V
timm
584
2
Vit Base Patch32 224 In21k
Apache-2.0
This Vision Transformer (ViT) model is pretrained on the ImageNet-21k dataset at 224x224 resolution, suitable for image classification tasks.
Image Classification
V
google
35.10k
19
Vit Large Patch16 224 In21k
Apache-2.0
A Vision Transformer model pretrained on the ImageNet-21k dataset, suitable for image feature extraction and downstream task fine-tuning.
Image Classification
V
google
92.63k
26
Featured Recommended AI Models