C

CLIP ViT B 32 DataComp.XL S13b B90k

Developed by laion
This is a CLIP ViT-B/32 model trained on the DataComp-1B dataset, designed for tasks like zero-shot image classification and image-text retrieval.
Downloads 12.12k
Release Time : 9/29/2023

Model Overview

The model was trained using the OpenCLIP framework on the DataComp-1B dataset, aiming to provide research outputs for the community to explore zero-shot, arbitrary image classification.

Model Features

Large-scale data training
Trained on 1.4 billion samples from the DataComp-1B dataset, covering a wide range of visual concepts.
Zero-shot capability
Capable of performing image classification and retrieval tasks without task-specific fine-tuning.
Research-friendly
Designed for the research community, supporting interdisciplinary studies and potential impact analysis.

Model Capabilities

Zero-shot image classification
Image-text retrieval
Cross-modal understanding
Image feature extraction

Use Cases

Research applications
Zero-shot image classification research
Explore the model's performance under different classification taxonomies
Achieves 72.7% zero-shot top-1 accuracy on ImageNet-1k
Cross-modal understanding research
Study the associative learning between visual and language modalities
Potential applications
Image search systems
Retrieve relevant images based on text queries
Content moderation assistance
Identify potentially harmful content in images
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase