V

Vit Base Patch16 224 In21k Dog Vs Cat Image Classification

Developed by DunnBC22
A cat and dog image classification model fine-tuned based on Google Vision Transformer (ViT) architecture, achieving 99% accuracy on the test set
Downloads 20
Release Time : 1/11/2023

Model Overview

This is a binary classification model for distinguishing between cats and dogs, fine-tuned based on a pre-trained ViT model, suitable for simple image classification tasks

Model Features

High accuracy
Achieves 99% accuracy and 0.9897 F1 score on cat-dog classification tasks
Based on ViT architecture
Uses the Vision Transformer base architecture, suitable for image classification tasks
Lightweight fine-tuning
Only requires 3 training epochs to achieve high performance, with a learning rate of 0.0002

Model Capabilities

Image classification
Binary classification
Animal recognition

Use Cases

Pet recognition
Cat-dog classification
Automatically identifies whether the image contains a cat or a dog
99% accuracy
Content management
Pet image classification
Automatically classifies uploaded cat and dog images for pet picture websites
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase