Mobileclip B OpenCLIP
MobileCLIP-B is an efficient image-text model that achieves fast inference through multimodal reinforcement training and excels in zero-shot image classification tasks.
Downloads 715
Release Time : 6/7/2024
Model Overview
MobileCLIP is a fast image-text model specifically designed for efficient zero-shot image classification. Through multimodal reinforcement training methods, it achieves performance comparable to larger models while maintaining a compact size.
Model Features
Efficient Performance
Achieves performance comparable to larger models while maintaining a compact size
Fast Inference
Total image + text processing latency of only 13.7ms (MobileCLIP-B)
Multimodal Training
Employs multimodal reinforcement training methods to enhance model performance
Zero-shot Capability
Demonstrates strong zero-shot classification ability on unseen categories
Model Capabilities
Zero-shot image classification
Image-text matching
Multimodal understanding
Use Cases
Computer Vision
Image Classification
Classifies images without specific training
Achieves 76.8% zero-shot accuracy on ImageNet-1k
Image-Text Retrieval
Retrieves relevant images based on text descriptions
Mobile Applications
Mobile Visual Search
Implements efficient visual search functionality on mobile devices
Featured Recommended AI Models