G

Google Vit Base Patch16 224 Face

Developed by jayanta
A Vision Transformer model fine-tuned on an image folder dataset based on google/vit-base-patch16-224 for image classification tasks.
Downloads 18
Release Time : 1/12/2023

Model Overview

This model is an image classification model based on the Vision Transformer (ViT) architecture, fine-tuned for specific domain image recognition tasks.

Model Features

Based on ViT Architecture
Utilizes the Vision Transformer architecture with self-attention mechanisms for image data processing
Fine-Tuned Version
Fine-tuned on the base model to adapt to specific image classification tasks
Medium-Scale Model
Employs a base-scale ViT model, balancing performance and computational resource requirements

Model Capabilities

Image Classification
Feature Extraction
Visual Pattern Recognition

Use Cases

Computer Vision
Facial Image Classification
Classifies and recognizes images containing human faces
Achieves 72.49% accuracy on the evaluation set
General Image Classification
Classifies and recognizes various types of images
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase