# Image feature extraction

Aimv2 1b Patch14 224.apple Pt
AIM-v2 is an image encoder model based on the timm library, with a scale of 1 billion parameters, suitable for image feature extraction tasks.
Image Classification Transformers
A
timm
198
0
Resnet101 Clip Gap.openai
Apache-2.0
ResNet101 image encoder based on CLIP framework, extracting image features through Global Average Pooling (GAP)
Image Classification Transformers
R
timm
104
0
Dinov2 With Registers Base Imagenet1k 1 Layer
Apache-2.0
A vision transformer model based on the Transformer architecture, trained using the DINOv2 method and introducing a register mechanism to solve the artifact problem of traditional ViT models.
Image Classification Transformers
D
facebook
693
2
Ijepa Vith16 1k
I-JEPA is a self-supervised learning method that predicts representations of other parts of an image from partial representations, without relying on predefined manual data transformations or pixel-level detail filling.
Image Classification Transformers
I
facebook
153
0
Ijepa Vith14 22k
I-JEPA is a self-supervised learning method that predicts representations of other parts of an image from partial representations, without relying on predefined manual data transformations or pixel-level detail filling.
Image Classification Transformers
I
facebook
48
0
Sscd Copy Detection
Apache-2.0
SSCD is a deep learning model for image copy detection, capable of extracting image features and performing similarity comparisons.
Image Classification Transformers
S
m3
48
1
Vit Large Patch16 224.orig In21k
Apache-2.0
A Vision Transformer (ViT) based image classification model, pretrained on ImageNet-21k by Google Research using JAX framework and later ported to PyTorch. Suitable for feature extraction and fine-tuning scenarios.
Image Classification Transformers
V
timm
584
2
Eva02 Tiny Patch14 224.mim In22k
MIT
EVA02 is a Vision Transformer model pre-trained on ImageNet-22k through masked image modeling, suitable for image classification and feature extraction tasks.
Image Classification Transformers
E
timm
385
1
Vit Small Patch16 224.dino
Apache-2.0
An image feature model based on Vision Transformer (ViT), trained using the self-supervised DINO method, suitable for image classification and feature extraction tasks.
Image Classification Transformers
V
timm
70.62k
4
Resnet34 Sketch Classifier
A sketch classifier based on the ResNet-34 architecture, fine-tuned on the TU-Berlin dataset, suitable for sketch recognition and classification tasks.
Image Classification Transformers
R
kmewhort
705
1
Vit Msn Base 4
Apache-2.0
This Vision Transformer model is pretrained using the MSN method and excels in few-shot scenarios, suitable for tasks like image classification
Image Classification Transformers
V
facebook
62
1
Vit Base Patch32 224 In21k
Apache-2.0
This Vision Transformer (ViT) model is pretrained on the ImageNet-21k dataset at 224x224 resolution, suitable for image classification tasks.
Image Classification
V
google
35.10k
19
Vit Huge Patch14 224 In21k
Apache-2.0
A Vision Transformer model pretrained on ImageNet-21k, featuring an extra-large architecture suitable for visual tasks like image classification.
Image Classification
V
google
47.78k
20
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase