MobileNet_V2_1.4_224 Open-source Image Classification Model - Lightweight Design for Mobile Devices

Mobilenet V2 1.4 224

Developed by google

A lightweight image classification model pre-trained on the ImageNet-1k dataset, specifically optimized for mobile devices

Image Classification

Transformers

Open Source License:Other #Lightweight Image Classification #Mobile Optimization #Low-Latency Inference

Downloads 737

Release Time : 11/10/2022

Model Overview

MobileNet V2 is a lightweight convolutional neural network designed for mobile and embedded vision applications. It achieves a good balance between latency, size, and accuracy, making it suitable for tasks like image classification.

Model Features

Lightweight Design

Optimized for mobile and embedded devices with low latency and low power consumption

Inverted Residual Structure

Utilizes innovative inverted residual and linear bottleneck structures to improve model efficiency

Configurable Parameters

Depth multiplier and resolution can be adjusted for different application scenarios

Model Capabilities

Image Classification

Feature Extraction

Use Cases

Computer Vision

Mobile Device Image Classification

Real-time image classification on mobile devices such as smartphones

Embedded Vision Systems

Deploy visual recognition functions on resource-constrained embedded devices

🚀 MobileNet V2

The MobileNet V2 model is pre - trained on ImageNet - 1k at a resolution of 224x224. It offers efficient image classification capabilities suitable for various resource - constrained scenarios.

🚀 Quick Start

You can use the raw model for image classification. Check out the model hub to find fine - tuned versions for tasks that interest you.

✨ Features

Lightweight and Efficient: MobileNets are small, low - latency, low - power models designed to meet the resource constraints of various use cases.
Versatile Applications: Can be used for classification, detection, embeddings, and segmentation, similar to other large - scale models.
Trade - off Optimization: Balances latency, size, and accuracy, comparing favorably with popular models from the literature.

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

Here is how to use this model to classify an image of the COCO 2017 dataset into one of the 1,000 ImageNet classes:

from transformers import AutoImageProcessor, AutoModelForImageClassification
from PIL import Image
import requests

url = "http://images.cocodataset.org/val2017/000000039769.jpg"
image = Image.open(requests.get(url, stream=True).raw)

preprocessor = AutoImageProcessor.from_pretrained("google/mobilenet_v2_1.4_224")
model = AutoModelForImageClassification.from_pretrained("google/mobilenet_v2_1.4_224")

inputs = preprocessor(images=image, return_tensors="pt")

outputs = model(**inputs)
logits = outputs.logits

# model predicts one of the 1000 ImageNet classes
predicted_class_idx = logits.argmax(-1).item()
print("Predicted class:", model.config.id2label[predicted_class_idx])

Note: This model actually predicts 1001 classes, the 1000 classes from ImageNet plus an extra “background” class (index 0). Currently, both the feature extractor and model support PyTorch.

📚 Documentation

Model description

From the original README:

MobileNets are small, low - latency, low - power models parameterized to meet the resource constraints of a variety of use cases. They can be built upon for classification, detection, embeddings and segmentation similar to how other popular large scale models, such as Inception, are used. MobileNets can be run efficiently on mobile devices [...] MobileNets trade off between latency, size and accuracy while comparing favorably with popular models from the literature.

The checkpoints are named mobilenet_v2_depth_size, for example mobilenet_v2_1.4_224, where 1.4 is the depth multiplier and 224 is the resolution of the input images the model was trained on.

Intended uses & limitations

You can use the raw model for image classification. See the model hub to look for fine - tuned versions on a task that interests you.

BibTeX entry and citation info

@inproceedings{mobilenetv22018,
  title={MobileNetV2: Inverted Residuals and Linear Bottlenecks},
  author={Mark Sandler and Andrew Howard and Menglong Zhu and Andrey Zhmoginov and Liang - Chieh Chen},
  booktitle={CVPR},
  year={2018}
}

📄 License

The license for this model is "other".

Property	Details
Model Type	MobileNet V2
Training Data	ImageNet - 1k
Tags	vision, image - classification

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご