V

Vit Base Patch16 224 In21k Iiii

Developed by Imene
This model is a fine-tuned Vision Transformer based on google/vit-base-patch16-224-in21k, primarily used for image classification tasks.
Downloads 21
Release Time : 9/2/2022

Model Overview

This is an image classification model based on the Vision Transformer architecture, fine-tuned on specific datasets for image recognition and classification tasks.

Model Features

ViT-based Architecture
Utilizes the Vision Transformer architecture with self-attention mechanisms for image data processing
Transfer Learning
Fine-tuned from the pre-trained vit-base-patch16-224-in21k model
Mixed Precision Training
Trained with mixed_float16 precision for improved training efficiency

Model Capabilities

Image Classification
Feature Extraction

Use Cases

Computer Vision
General Image Classification
Classify and recognize input images
Achieves 39.07% accuracy on the validation set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase