Vit Base Cats Vs Dogs
A Vision Transformer model fine-tuned on a cats vs. dogs classification dataset based on Google's ViT base model, achieving 98.83% accuracy
Downloads 394
Release Time : 3/2/2022
Model Overview
This model is an image classification model based on the Vision Transformer architecture, specifically designed to distinguish between pictures of cats and dogs.
Model Features
High Accuracy
Achieves 98.83% accuracy on the cats vs. dogs classification task
Based on ViT Architecture
Utilizes the Vision Transformer base architecture with powerful image feature extraction capabilities
Lightweight Fine-tuning
Fine-tuned based on a pre-trained model, ensuring high training efficiency
Model Capabilities
Image Classification
Cat and Dog Recognition
Visual Feature Extraction
Use Cases
Pet Recognition
Pet Photo Classification
Automatically identifies whether the animal in the photo is a cat or a dog
Accuracy: 98.83%
Smart Photo Albums
Automatic Photo Classification
Helps users automatically categorize pet photos
Featured Recommended AI Models
Š 2025AIbase