Vit Base Patch32 224 In21 Leicester Binary
A binary image classification model based on the Google Vision Transformer (ViT) architecture, fine-tuned on a specific dataset to achieve high-precision classification
Downloads 15
Release Time : 12/6/2022
Model Overview
This is a vision Transformer model based on the ViT architecture, specifically fine-tuned for binary classification tasks and performs excellently on the evaluation set (F1 score of 0.9873).
Model Features
High-precision classification
Achieves an F1 score of 0.9873 on the evaluation set, showing excellent performance
Based on ViT architecture
Adopts the Vision Transformer architecture and uses the self-attention mechanism to process images
Efficient fine-tuning
Fine-tunes based on a pre-trained model, saving training resources
Model Capabilities
Image classification
Binary classification task processing
Visual feature extraction
Use Cases
Medical image analysis
Lesion detection
Used to identify specific lesion features in medical images
Industrial quality inspection
Defective product detection
Identifies defective products on the production line
Featured Recommended AI Models
Š 2025AIbase