V

Vit Base Patch16 384 Wi5

Developed by Imene
This model is a fine-tuned Vision Transformer based on google/vit-base-patch16-384, primarily used for image classification tasks.
Downloads 21
Release Time : 9/6/2022

Model Overview

This is an image classification model based on the Vision Transformer architecture, using the patch16-384 configuration and fine-tuned on a specific dataset.

Model Features

High-resolution Processing
Supports input resolution of 384x384 pixels
Efficient Fine-tuning
Targeted fine-tuning on the base model to adapt to specific tasks
Mixed Precision Training
Uses mixed_float16 precision for training, balancing accuracy and efficiency

Model Capabilities

Image Classification
Visual Feature Extraction

Use Cases

Computer Vision
General Image Classification
Classifies and identifies input images
Validation accuracy 49.12%, Top-3 accuracy 73.02%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase