Aimv2 Large Patch14 Native Image Classification
A
Aimv2 Large Patch14 Native Image Classification
Developed by amaye15
AIMv2-Large-Patch14-Native is an adapted image classification model, modified from the original AIMv2 model to be compatible with Hugging Face Transformers' AutoModelForImageClassification class.
Downloads 15
Release Time : 11/25/2024
Model Overview
This model is an adapted version of the original AIMv2 model, modified to be compatible with Hugging Face Transformers' AutoModelForImageClassification class for image classification tasks.
Model Features
Multimodal Autoregressive Pre-training
The AIMv2 model is pre-trained with multimodal autoregressive objectives, demonstrating outstanding performance across various benchmarks.
Compatible with Hugging Face Transformers
After adaptation, this model can be directly used with AutoModelForImageClassification, making it easy to integrate into existing workflows.
High Performance
The AIMv2 series outperforms OAI CLIP and SigLIP in most multimodal understanding benchmarks and surpasses DINOv2 in open-vocabulary object detection and referring expression comprehension tasks.
Model Capabilities
Image Classification
Visual Understanding
Use Cases
Computer Vision
General Image Classification
Classify input images to identify the main objects or scenes within them.
Featured Recommended AI Models
Š 2025AIbase