V

Vit Base Patch16 224 In21k Finetuned Cifar10

Developed by nielsr
This is a vision Transformer model based on Google's ViT base model, fine-tuned on the CIFAR10 dataset for image classification tasks.
Downloads 31
Release Time : 3/2/2022

Model Overview

This model is an image classification model based on the Vision Transformer (ViT) architecture, achieving 98.81% accuracy after fine-tuning on the CIFAR10 dataset.

Model Features

High Accuracy
Achieves 98.81% classification accuracy on the CIFAR10 test set.
Transformer-based Architecture
Uses the Vision Transformer (ViT) architecture, suitable for processing image data.
Fine-tuned Pre-trained Model
Fine-tuned based on the google/vit-base-patch16-224-in21k pre-trained model.

Model Capabilities

Image Classification
Visual Feature Extraction

Use Cases

Computer Vision
CIFAR10 Image Classification
Classify images in the CIFAR10 dataset.
Accuracy reaches 98.81%.
General Image Classification
Can be used for other image classification tasks of similar scale.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase