V

Vit Base Oxford Iiit Pets

Developed by ISxOdin
A pet breed classification model fine-tuned based on Google Vision Transformer (ViT), achieving 94.45% accuracy on the Oxford-IIIT Pets dataset
Downloads 78
Release Time : 4/1/2025

Model Overview

This model is a fine-tuned version of google/vit-base-patch16-224 on the pcuenq/oxford-pets dataset, specifically designed for image classification tasks identifying 37 different cat and dog breeds.

Model Features

High Accuracy
Achieves 94.45% classification accuracy on the Oxford-IIIT Pets dataset
Transfer Learning
Fine-tuned based on a pre-trained Vision Transformer model, effectively utilizing pre-trained knowledge
Education-Friendly
Suitable as a teaching demonstration case for transfer learning and vision model fine-tuning

Model Capabilities

Image Classification
Pet Breed Recognition
Transfer Learning Fine-tuning

Use Cases

Education
Transfer Learning Teaching
Used as a teaching case for transfer learning in computer vision courses
Pet Recognition
Pet Breed Classification
Identifies 37 different cat and dog breeds
94.45% accuracy
Model Comparison
Comparison with Zero-shot Models
Performance comparison analysis with zero-shot models like CLIP
CLIP accuracy 88.00%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase