Vit Base Patch16 224 In21k Finetuned Cifar10
V
Vit Base Patch16 224 In21k Finetuned Cifar10
Developed by nielsr
This is a vision Transformer model based on Google's ViT base model, fine-tuned on the CIFAR10 dataset for image classification tasks.
Downloads 31
Release Time : 3/2/2022
Model Overview
This model is an image classification model based on the Vision Transformer (ViT) architecture, achieving 98.81% accuracy after fine-tuning on the CIFAR10 dataset.
Model Features
High Accuracy
Achieves 98.81% classification accuracy on the CIFAR10 test set.
Transformer-based Architecture
Uses the Vision Transformer (ViT) architecture, suitable for processing image data.
Fine-tuned Pre-trained Model
Fine-tuned based on the google/vit-base-patch16-224-in21k pre-trained model.
Model Capabilities
Image Classification
Visual Feature Extraction
Use Cases
Computer Vision
CIFAR10 Image Classification
Classify images in the CIFAR10 dataset.
Accuracy reaches 98.81%.
General Image Classification
Can be used for other image classification tasks of similar scale.
Featured Recommended AI Models
Š 2025AIbase