Aimv2 Huge Patch14 224
A
Aimv2 Huge Patch14 224
Developed by apple
The AIMv2 series are vision models pretrained with multimodal autoregressive objectives, demonstrating excellent performance across multiple benchmarks.
Downloads 54
Release Time : 10/29/2024
Model Overview
AIMv2 is an advanced vision model employing multimodal autoregressive pretraining, excelling in image classification and feature extraction tasks.
Model Features
Multimodal Autoregressive Pretraining
Utilizes innovative multimodal autoregressive objectives for pretraining to enhance model performance
Outstanding Benchmark Performance
Outperforms models like CLIP, SigLIP, and DINOv2 on multiple vision benchmarks
Large-scale Scalability
Simple and straightforward pretraining method enables effective training scale expansion
Model Capabilities
Image classification
Image feature extraction
Multimodal understanding
Open-vocabulary object detection
Referring expression comprehension
Use Cases
Computer Vision
Image Classification
High-precision image classification on datasets like ImageNet
Achieves 87.5% accuracy on ImageNet-1k
Fine-grained Classification
Fine-grained image classification for specific domains
Achieves 96.4% accuracy on stanford-cars
Medical Image Analysis
Medical image classification and analysis
Achieves 93.3% accuracy on camelyon17
Multimodal Applications
Open-vocabulary Object Detection
Detects objects in images not explicitly labeled in the training set
Outperforms DINOv2
Referring Expression Comprehension
Understands natural language referring expressions and locates corresponding regions in images
Outperforms DINOv2
Featured Recommended AI Models
ยฉ 2025AIbase