C

CLIP Convnext Large D.laion2b S26b B102k Augreg

Developed by laion
Large-scale ConvNeXt-Large CLIP model trained on LAION-2B dataset, supporting zero-shot image classification and image-text retrieval tasks
Downloads 80.74k
Release Time : 1/29/2023

Model Overview

This model adopts ConvNeXt-Large architecture with enhanced data augmentation and regularization techniques, achieving 75.9% accuracy on ImageNet zero-shot classification

Model Features

ConvNeXt Architecture Innovation
Among the first large-scale trained ConvNeXt CLIP models, significantly improving computational efficiency compared to traditional ViT architectures
Enhanced Data Augmentation
Employs advanced regularization techniques like Random Resized Crop (RRC), Random Erasing (RE), and Stochastic Depth (SD) to enhance model robustness
Efficient Training
Achieves superior performance with only half the computational cost of ViT-L/16 at 256 resolution

Model Capabilities

Zero-shot image classification
Image-text similarity calculation
Cross-modal retrieval

Use Cases

Image Understanding
Zero-shot Image Classification
Classify new images without fine-tuning
75.9% zero-shot accuracy on ImageNet-1k
Cross-modal Retrieval
Image-Text Retrieval
Search relevant images based on text or generate descriptions from images
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase