M

Mobileclip B OpenCLIP

Developed by apple
MobileCLIP-B is an efficient image-text model that achieves fast inference through multimodal reinforcement training and excels in zero-shot image classification tasks.
Downloads 715
Release Time : 6/7/2024

Model Overview

MobileCLIP is a fast image-text model specifically designed for efficient zero-shot image classification. Through multimodal reinforcement training methods, it achieves performance comparable to larger models while maintaining a compact size.

Model Features

Efficient Performance
Achieves performance comparable to larger models while maintaining a compact size
Fast Inference
Total image + text processing latency of only 13.7ms (MobileCLIP-B)
Multimodal Training
Employs multimodal reinforcement training methods to enhance model performance
Zero-shot Capability
Demonstrates strong zero-shot classification ability on unseen categories

Model Capabilities

Zero-shot image classification
Image-text matching
Multimodal understanding

Use Cases

Computer Vision
Image Classification
Classifies images without specific training
Achieves 76.8% zero-shot accuracy on ImageNet-1k
Image-Text Retrieval
Retrieves relevant images based on text descriptions
Mobile Applications
Mobile Visual Search
Implements efficient visual search functionality on mobile devices
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase