D

Deit Small Patch16 224

Developed by facebook
DeiT is a more efficiently trained Vision Transformer model, pre-trained and fine-tuned on the ImageNet-1k dataset at 224x224 resolution, suitable for image classification tasks.
Downloads 24.53k
Release Time : 3/2/2022

Model Overview

This model is an image classification model based on the Transformer architecture, achieving data-efficient training through attention mechanisms, primarily used for 1000-class ImageNet image classification tasks.

Model Features

Data-efficient Training
Achieves more efficient training than traditional ViT through attention mechanisms, reducing data requirements
Small Model Size
Fewer parameters (22M) compared to the base model, suitable for resource-constrained scenarios
High Accuracy
Achieves 79.9% top-1 accuracy on ImageNet-1k

Model Capabilities

Image Classification
Feature Extraction

Use Cases

Computer Vision
Image Classification
Classify images into one of the 1000 ImageNet categories
79.9% top-1 accuracy
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase