V

Vit Base Patch16 224 In21k Wwwwwi

Developed by Imene
This model is a fine-tuned Vision Transformer based on google/vit-base-patch16-224-in21k on an unknown dataset, primarily used for image classification tasks.
Downloads 21
Release Time : 9/1/2022

Model Overview

This is an image classification model based on the Vision Transformer architecture, fine-tuned for specific domain image recognition tasks.

Model Features

Vision Transformer architecture
Utilizes advanced Transformer architecture for image data processing with powerful feature extraction capabilities
Pre-trained model fine-tuning
Fine-tuned based on the google/vit-base-patch16-224-in21k pre-trained model
Mixed precision training
Uses mixed_float16 precision for training to balance computational efficiency and model accuracy

Model Capabilities

Image classification
Feature extraction
Transfer learning

Use Cases

Computer vision
General image classification
Can be used for classifying common objects and scenes
Achieved 25.4% accuracy on the validation set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase