Dinov2 With Registers Base
D

Dinov2 With Registers Base

Developed by facebook
A vision Transformer model trained with DINOv2, optimized with register tokens to enhance attention mechanisms and improve feature extraction capabilities
Downloads 22.74k
Release Time : 12/20/2024

Model Overview

This is a base version of Vision Transformer (ViT) with registers, trained using the DINOv2 self-supervised method, capable of extracting high-quality feature representations from images for various computer vision tasks.

Model Features

Register mechanism
Eliminates attention map artifacts by adding dedicated register tokens, resulting in clearer attention distributions
Self-supervised learning
Trained using the DINOv2 method, capable of learning meaningful image feature representations without labeled data
Attention optimization
Improved attention mechanism provides more interpretable attention maps, aiding in understanding model decision processes

Model Capabilities

Image feature extraction
Self-supervised learning
Foundation model for computer vision tasks

Use Cases

Computer vision
Image classification
Can serve as a foundation model with added classification heads for image classification tasks
Object detection
Extracted image features can be used for object detection tasks
Image similarity calculation
Utilizes extracted feature vectors to compute similarity between images
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase