Model Selection

Self-supervised visual features

# Self-supervised visual features

Dinov2 Small ONNX

ONNX format version of DINOv2-small, suitable for vision tasks

Vit Large Patch14 Dinov2.lvd142m

A vision Transformer (ViT)-based image feature model, pre-trained on the LVD-142M dataset using the self-supervised DINOv2 method.

Image Classification

Dinov2 With Registers Small Imagenet1k 1 Layer

A vision Transformer model trained with DINOv2, improved by adding register tokens to enhance attention mechanism, eliminate artifacts, and boost performance

Image Classification

Dinov2.giant.patch 14

DINOv2 is a visual feature extraction model developed by Facebook Research team, achieving powerful image representation capabilities through self-supervised learning.

Dinov2.base.patch 14

DINOv2 is a self-supervised visual feature extraction model developed by Facebook Research, capable of generating robust visual feature representations.

Dinov2.small.patch 14

DINOv2 is a visual feature extraction model developed by Facebook Research that generates robust visual features without supervised learning.

Vit Small Patch14 Reg4 Dinov2.lvd142m

A visual Transformer (ViT) image feature model with registers, pre-trained using the self-supervised DINOv2 method on the LVD-142M dataset.

Image Classification

Vit Large Patch14 Reg4 Dinov2.lvd142m

A Vision Transformer (ViT) image feature model with registers, pre-trained using self-supervised DINOv2 method on the LVD-142M dataset.

Image Classification

Vit Giant Patch14 Reg4 Dinov2.lvd142m

A vision Transformer (ViT) image feature model with registers, pretrained using the self-supervised DINOv2 method on the LVD-142M dataset.

Image Classification

Vit Base Patch14 Reg4 Dinov2.lvd142m

A visual transformer (ViT) image feature model with registers, pre-trained using the self-supervised DINOv2 method on the LVD-142M dataset.

Image Classification

Dinov2 Small Imagenet1k 1 Layer

A small vision Transformer model trained using the DINOv2 method, suitable for image feature extraction and classification tasks

Image Classification

A small-scale vision Transformer model trained using the DINOv2 method, extracting image features through self-supervised learning

Image Classification

A vision Transformer model trained using the DINOv2 method for self-supervised image feature extraction

Image Classification

A vision Transformer model trained using the DINOv2 method, extracting robust visual features from massive image data through self-supervised learning

Image Classification

Vision Transformer model trained using the DINOv2 method, extracting image features through self-supervised learning

Image Classification

Vit Small Patch14 Dinov2.lvd142m

A vision Transformer (ViT)-based image feature model pre-trained using self-supervised DINOv2 method on the LVD-142M dataset

Image Classification

Vit Large Patch14 Dinov2.lvd142m

A self-supervised image feature model based on Vision Transformer (ViT), pre-trained using the DINOv2 method on the LVD-142M dataset, suitable for image classification and feature extraction tasks.

Image Classification

Vit Giant Patch14 Dinov2.lvd142m

A giant vision Transformer (ViT)-based image feature extraction model, pre-trained using self-supervised DINOv2 method on the LVD-142M dataset

Image Classification

Vit Base Patch14 Dinov2.lvd142m

A Vision Transformer (ViT)-based image feature model, pre-trained using self-supervised DINOv2 method on the LVD-142M dataset

Image Classification

Vit Large Patch16 224.mae

Large-scale image feature extraction model based on Vision Transformer (ViT), pre-trained on ImageNet-1k dataset using self-supervised Masked Autoencoder (MAE) method

Image Classification

Vit Base Patch16 224.mae

Vision Transformer (ViT) based image feature extraction model, pre-trained on ImageNet-1k dataset using self-supervised masked autoencoder (MAE) method

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase