V

Vit Giant Patch14 Reg4 Dinov2.lvd142m

Developed by timm
A vision Transformer (ViT) image feature model with registers, pretrained using the self-supervised DINOv2 method on the LVD-142M dataset.
Downloads 917
Release Time : 10/30/2023

Model Overview

This model is primarily used for image classification and feature extraction tasks, based on the vision Transformer architecture and pretrained on a large dataset through self-supervised learning.

Model Features

Register enhancement
The model employs register technology to enhance the performance and stability of the vision Transformer.
Self-supervised learning
Pretrained on the LVD-142M dataset using the DINOv2 self-supervised learning method.
Large-scale pretraining
Pretrained on the large-scale LVD-142M dataset, featuring robust feature extraction capabilities.

Model Capabilities

Image feature extraction
Image classification
Visual representation learning

Use Cases

Computer vision
Image classification
Can be used for classifying images, supporting recognition of multiple categories.
Performs excellently on multiple benchmark datasets
Feature extraction
Can serve as a feature extractor for downstream vision tasks.
Extracted features can be used for tasks like object detection and image segmentation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase