V

Vit Small Patch14 Reg4 Dinov2.lvd142m

Developed by timm
A visual Transformer (ViT) image feature model with registers, pre-trained using the self-supervised DINOv2 method on the LVD-142M dataset.
Downloads 15.98k
Release Time : 10/30/2023

Model Overview

This model is primarily used for image classification and feature extraction, employing a visual Transformer architecture combined with a register mechanism to enhance performance.

Model Features

Register mechanism
Utilizes a register mechanism to enhance the performance of the visual Transformer, addressing issues in traditional ViT models.
Self-supervised pre-training
Pre-trained on the LVD-142M dataset using the DINOv2 self-supervised learning method, eliminating the need for manual annotations.
Efficient feature extraction
The model has a relatively small parameter count (22.1M) but efficiently extracts image features, making it suitable for various downstream tasks.

Model Capabilities

Image classification
Image feature extraction
Visual representation learning

Use Cases

Computer vision
Image classification
Can be used for general image classification tasks, such as identifying objects, scenes, etc.
Feature extraction
Extracts image features for downstream tasks like object detection and image retrieval.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase