D

Dinov2 Base

Developed by facebook
Vision Transformer model trained using the DINOv2 method, extracting image features through self-supervised learning
Downloads 1.9M
Release Time : 7/17/2023

Model Overview

This model is a vision model based on the Transformer architecture, pre-trained on large-scale image data in a self-supervised manner, and can be used to extract image features to support downstream vision tasks.

Model Features

Self-supervised learning
Learns visual features automatically from large-scale image data without manual annotation
Robust feature extraction
Capable of extracting general image features suitable for various downstream tasks
Transformer architecture
Utilizes advanced Vision Transformer architecture to process image data

Model Capabilities

Image feature extraction
Visual representation learning
Image semantic understanding

Use Cases

Computer vision
Image classification
Fine-tune by adding a classification head on top of the pre-trained model
Object detection
Used as a feature extractor for object detection tasks
Image similarity calculation
Compute image similarity using extracted feature vectors
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase