E

Eva02 Tiny Patch14 224.mim In22k

Developed by timm
EVA02 is a Vision Transformer model pre-trained on ImageNet-22k through masked image modeling, suitable for image classification and feature extraction tasks.
Downloads 385
Release Time : 3/31/2023

Model Overview

The EVA02 model is a Vision Transformer that incorporates techniques such as mean pooling, SwiGLU, and rotary position embedding (ROPE), suitable for image classification and feature extraction.

Model Features

Masked image modeling pre-training
Pre-trained using EVA-CLIP as the MIM teacher, which improves the model's representation ability.
Efficient architecture design
Adopts techniques such as mean pooling, SwiGLU activation function, and rotary position embedding (ROPE) to optimize model performance.
Lightweight model
With only 5.5 million parameters, it is suitable for resource-constrained environments.

Model Capabilities

Image classification
Image feature extraction
Visual representation learning

Use Cases

Computer vision
Image classification
Can be used to classify images and support recognition of multiple categories.
Pre-trained on ImageNet-22k, with high classification accuracy.
Feature extraction
Can be used to extract deep features of images, suitable for downstream tasks such as object detection and image retrieval.
Provides high-quality image representations.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase