V

Vit Base Patch16 224 Wi2

Developed by Imene
Vision Transformer model fine-tuned from google/vit-base-patch16-224, suitable for image classification tasks
Downloads 21
Release Time : 9/10/2022

Model Overview

This model is an image classification model based on the Vision Transformer (ViT) architecture, with improved performance on specific datasets through fine-tuning.

Model Features

Based on ViT architecture
Utilizes the Vision Transformer architecture with self-attention mechanisms for image processing
Mixed precision training
Trained with mixed_float16 precision to optimize computational efficiency
AdamW optimizer
Employs AdamWeightDecay optimizer with polynomial learning rate decay strategy

Model Capabilities

Image classification
Feature extraction

Use Cases

Computer vision
General image classification
Classifies input images
Achieves 24.91% accuracy on the validation set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase