A

Aimv2 3B Patch14 224

Developed by apple
AIMv2 is a series of vision models pretrained with multimodal autoregressive objectives, achieving outstanding performance in multiple benchmarks
Downloads 57
Release Time : 10/29/2024

Model Overview

AIMv2 is a powerful vision model pretrained with multimodal autoregressive objectives, excelling in image classification and understanding tasks

Model Features

Multimodal Autoregressive Pretraining
Utilizes innovative multimodal autoregressive objectives for pretraining, enhancing model comprehension
Exceptional Classification Performance
Achieves top-tier accuracy on benchmarks like ImageNet
Large-Scale Parameters
A robust 3B-parameter model capable of capturing richer visual features

Model Capabilities

Image feature extraction
Image classification
Multimodal understanding
Open-vocabulary object detection
Referring expression comprehension

Use Cases

Computer Vision
General Image Classification
Performs image classification on standard datasets like ImageNet
ImageNet-1k accuracy 88.5%
Fine-Grained Classification
Application in fine-grained classification tasks such as stanford-cars
stanford-cars accuracy 96.5%
Medical Image Analysis
Application on medical image datasets like camelyon17
camelyon17 accuracy 93.5%
Featured Recommended AI Models
ยฉ 2025AIbase