A

Aimv2 Large Patch14 Native Image Classification

Developed by amaye15
AIMv2-Large-Patch14-Native is an adapted image classification model, modified from the original AIMv2 model to be compatible with Hugging Face Transformers' AutoModelForImageClassification class.
Downloads 15
Release Time : 11/25/2024

Model Overview

This model is an adapted version of the original AIMv2 model, modified to be compatible with Hugging Face Transformers' AutoModelForImageClassification class for image classification tasks.

Model Features

Multimodal Autoregressive Pre-training
The AIMv2 model is pre-trained with multimodal autoregressive objectives, demonstrating outstanding performance across various benchmarks.
Compatible with Hugging Face Transformers
After adaptation, this model can be directly used with AutoModelForImageClassification, making it easy to integrate into existing workflows.
High Performance
The AIMv2 series outperforms OAI CLIP and SigLIP in most multimodal understanding benchmarks and surpasses DINOv2 in open-vocabulary object detection and referring expression comprehension tasks.

Model Capabilities

Image Classification
Visual Understanding

Use Cases

Computer Vision
General Image Classification
Classify input images to identify the main objects or scenes within them.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase