V

Vit Bigg 14 CLIPA 336 Datacomp1b

Developed by UCSC-VLAA
CLIPA-v2 model, an efficient contrastive image-text model, focused on zero-shot image classification tasks
Downloads 259
Release Time : 10/17/2023

Model Overview

This model is a contrastive image-text model based on the CLIPA-v2 architecture, capable of performing image classification in a zero-shot setting without task-specific fine-tuning

Model Features

Efficient zero-shot learning
Capable of accurate image classification without task-specific fine-tuning
Low-cost high-performance
Achieves 81.1% zero-shot ImageNet accuracy with a relatively low budget
Inverse scaling law
Adopts innovative training methods following the inverse scaling law to improve training efficiency

Model Capabilities

Zero-shot image classification
Image-text matching
Multimodal feature extraction

Use Cases

Image classification
Zero-shot image recognition
Classify images without training
Achieves 81.1% zero-shot accuracy on ImageNet
Content retrieval
Image-text matching
Retrieve relevant images based on text descriptions
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase