A

Aimv2 3B Patch14 448

Developed by apple
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, demonstrating excellent performance across multiple visual understanding benchmarks.
Downloads 161
Release Time : 10/29/2024

Model Overview

The AIMv2 series of vision models are pretrained with multimodal autoregressive objectives, featuring robust image feature extraction and classification capabilities, outperforming peer models in multiple benchmarks.

Model Features

Multimodal Autoregressive Pretraining
Pretrained with multimodal autoregressive objectives, effectively enhancing model performance.
Outstanding Classification Performance
Outperforms models like OpenAI CLIP, SigLIP, and DINOv2 in multiple benchmarks.
Large-Scale Parameters
With 3B parameters, the model possesses powerful feature extraction capabilities.

Model Capabilities

Image feature extraction
Image classification
Multimodal understanding

Use Cases

Computer Vision
Image Classification
High-precision image classification on datasets like ImageNet.
ImageNet-1k accuracy 89.5%
Fine-Grained Classification
Excellent performance in fine-grained classification tasks like stanford-cars.
stanford-cars accuracy 96.7%
Medical Imaging
Pathological Image Analysis
Classification on medical imaging datasets like camelyon17.
camelyon17 accuracy 93.4%
Featured Recommended AI Models
ยฉ 2025AIbase