Resnet101 Clip.openai
R
Resnet101 Clip.openai
Developed by timm
A CLIP model based on ResNet101 architecture, supporting zero-shot image classification tasks.
Downloads 2,717
Release Time : 6/9/2024
Model Overview
This model combines the visual encoding capability of ResNet101 with the multimodal learning ability of CLIP, enabling image classification without fine-tuning.
Model Features
Zero-shot learning
Performs image classification tasks without task-specific fine-tuning
Multimodal understanding
Simultaneously understands visual and textual information for cross-modal matching
ResNet101 backbone
Uses the proven ResNet101 architecture as the visual encoder
Model Capabilities
Image classification
Cross-modal retrieval
Zero-shot learning
Use Cases
Image understanding
Zero-shot image classification
Classifies images using natural language descriptions without category-specific training
Content retrieval
Image-text matching
Retrieves relevant images based on text descriptions or generates descriptions from images
Featured Recommended AI Models