V

Vit Base Patch16 384 Wi4

Developed by Imene
A fine-tuned Vision Transformer model based on google/vit-base-patch16-384, suitable for image classification tasks
Downloads 21
Release Time : 9/6/2022

Model Overview

This model is an image classification model based on the Vision Transformer (ViT) architecture, fine-tuned on a specific dataset for image recognition and classification tasks

Model Features

High-resolution Processing
Supports high-resolution image input of 384x384 pixels
Transfer Learning
Fine-tuned based on a pre-trained ViT model, suitable for domain-specific image classification tasks
Efficient Training
Uses mixed precision training (mixed_float16) to improve training efficiency

Model Capabilities

Image Classification
Visual Feature Extraction
Transfer Learning

Use Cases

Computer Vision
General Image Classification
Classifies input images and outputs class probabilities
Achieved 57.46% accuracy on the validation set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase