V

Vit Base Patch16 384 Wi3

Developed by Imene
Fine-tuned model based on Google Vision Transformer (ViT) architecture, suitable for image classification tasks
Downloads 21
Release Time : 9/5/2022

Model Overview

This model is a fine-tuned version of the google/vit-base-patch16-384 pre-trained model on an unknown dataset, primarily used for image classification tasks.

Model Features

High-Resolution Processing Capability
Supports 384x384 pixel input resolution, suitable for processing high-resolution images
Efficient Fine-tuning
Fine-tuned based on pre-trained ViT model, performs well on specific tasks
Mixed Precision Training
Uses mixed_float16 precision training, balancing training efficiency and model accuracy

Model Capabilities

Image Classification
Visual Feature Extraction

Use Cases

Computer Vision
General Image Classification
Classify and recognize input images
Validation set accuracy 61.95%, Top-3 accuracy 82.98%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase