A

Aimv2 1B Patch14 336

Developed by apple
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving outstanding performance in multiple multimodal understanding benchmarks.
Downloads 52
Release Time : 10/29/2024

Model Overview

AIMv2 is a vision model pretrained with multimodal autoregressive objectives, featuring robust image feature extraction and classification capabilities.

Model Features

Multimodal Autoregressive Pretraining
Pretrained with multimodal autoregressive objectives to enhance performance in multimodal understanding tasks.
High Performance
Outperforms OAI CLIP and SigLIP in multiple benchmarks, demonstrating strong recognition capabilities.
Broad Applicability
Delivers excellent performance across various datasets such as ImageNet, CIFAR, and Food101.

Model Capabilities

Image Feature Extraction
Image Classification
Multimodal Understanding

Use Cases

Computer Vision
Image Classification
Classifies images and is applicable to multiple datasets.
Achieves 88.7% accuracy on ImageNet-1k.
Object Detection
Excels in open-vocabulary object detection tasks.
Outperforms the DINOv2 model.
Medical Imaging
Pathological Image Analysis
Used for analyzing medical imaging data.
Achieves 94.2% accuracy on the Camelyon17 dataset.
Featured Recommended AI Models
ยฉ 2025AIbase