D

Data2vec Vision Large

Developed by facebook
Data2Vec-Vision is a self-supervised learning model based on the BEiT architecture, pre-trained on the ImageNet-1k dataset, suitable for image classification tasks.
Downloads 225
Release Time : 4/14/2022

Model Overview

This model adopts the data2vec framework, learning latent representations of image data through self-distillation, supporting computer vision tasks.

Model Features

General self-supervised learning framework
Uses a unified data2vec framework to handle multimodal tasks, including speech, vision, and language.
Contextual latent representation prediction
Unlike predicting local features, the model predicts contextual representations that contain complete input information.
High-performance results
Achieves new state-of-the-art or competitive performance with mainstream methods in multiple benchmarks.

Model Capabilities

Image classification
Visual feature extraction

Use Cases

Computer vision
Image classification
Classifies images into 1000 categories
Performs excellently on the ImageNet-1k benchmark
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase