C

CLIP ViT B 16 DataComp.XL S13b B90k

Developed by flavour
This is a CLIP ViT-L/14 model trained on the DataComp-1B dataset, supporting zero-shot image classification and image-text retrieval tasks.
Downloads 39.22k
Release Time : 7/27/2023

Model Overview

The model was trained using the OpenCLIP framework on the DataComp-1B dataset, primarily for research on zero-shot image classification and cross-modal retrieval tasks.

Model Features

Large-scale training data
Trained on 1.4 billion samples from the DataComp-1B dataset
Zero-shot capability
Can perform various image classification tasks without fine-tuning
Cross-modal understanding
Capable of understanding relationships between images and text

Model Capabilities

Zero-shot image classification
Image-text retrieval
Cross-modal understanding

Use Cases

Research
Zero-shot image classification research
Explore model performance under different classification systems
Achieves 79.2% zero-shot top-1 accuracy on ImageNet-1k
Content management
Image retrieval
Retrieve relevant images based on text descriptions
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase