Google Vit Base Patch16 224 Face
A Vision Transformer model fine-tuned on an image folder dataset based on google/vit-base-patch16-224 for image classification tasks.
Downloads 18
Release Time : 1/12/2023
Model Overview
This model is an image classification model based on the Vision Transformer (ViT) architecture, fine-tuned for specific domain image recognition tasks.
Model Features
Based on ViT Architecture
Utilizes the Vision Transformer architecture with self-attention mechanisms for image data processing
Fine-Tuned Version
Fine-tuned on the base model to adapt to specific image classification tasks
Medium-Scale Model
Employs a base-scale ViT model, balancing performance and computational resource requirements
Model Capabilities
Image Classification
Feature Extraction
Visual Pattern Recognition
Use Cases
Computer Vision
Facial Image Classification
Classifies and recognizes images containing human faces
Achieves 72.49% accuracy on the evaluation set
General Image Classification
Classifies and recognizes various types of images
Featured Recommended AI Models
Š 2025AIbase