Mobileclip B LT OpenCLIP
MobileCLIP-B (LT) is an efficient image-text model developed by Apple, achieving fast zero-shot image classification through multimodal reinforcement training, outperforming similar models.
Downloads 774
Release Time : 6/7/2024
Model Overview
MobileCLIP is a fast image-text model specifically designed for zero-shot image classification tasks, delivering efficient performance through optimized architecture and training methods.
Model Features
Efficient performance
Significantly improves speed while maintaining high performance, 2-5 times faster than similar models
Compact size
Model size is 2-3 times smaller than similar ViT-B/16 models
Reinforcement training
Utilizes multimodal reinforcement training with 36B training samples
Zero-shot capability
Optimized for zero-shot image classification tasks without task-specific fine-tuning
Model Capabilities
Zero-shot image classification
Multimodal understanding
Fast inference
Use Cases
Computer vision
Image classification
Classify images without specific training
Achieves 77.2% zero-shot accuracy on ImageNet-1k
Multimodal retrieval
Enable cross-modal image-text retrieval
Mobile applications
Mobile image recognition
Lightweight image recognition suitable for deployment on mobile devices
Low latency (10.4ms for image + 3.3ms for text)
Featured Recommended AI Models
Š 2025AIbase