DeepLabV3 MobileNet V2 1.0 513 Open-Source Semantic Segmentation Model - Accurately Identify Image Semantic Scenes

Deeplabv3 Mobilenet V2 1.0 513

Developed by Matthijs

A semantic segmentation model based on MobileNetV2 architecture with DeepLabV3+ head, pre-trained on the PASCAL VOC dataset

Image Segmentation

Transformers

Open Source License:Other #Mobile Image Segmentation #Low-Power Model #Real-Time Semantic Segmentation

Downloads 164

Release Time : 6/28/2022

Model Overview

This is a lightweight semantic segmentation model that combines the efficiency of MobileNetV2 with the precise segmentation capabilities of DeepLabV3+, suitable for mobile devices and resource-constrained environments.

Model Features

Lightweight and Efficient

Based on MobileNetV2 architecture, optimized for mobile devices with low latency and low power consumption

Precise Segmentation

Incorporates DeepLabV3+ head to deliver high-quality semantic segmentation results

Pre-trained Model

Pre-trained on the PASCAL VOC dataset at 513x513 resolution, ready for immediate use

Model Capabilities

Image Semantic Segmentation

Object Boundary Recognition

Scene Understanding

Use Cases

Computer Vision

Autonomous Driving Scene Segmentation

Used to identify key elements such as roads, pedestrians, and vehicles

Medical Image Analysis

Can be used for organ or lesion segmentation in medical images

🚀 MobileNetV2 with DeepLabV3+

A MobileNet V2 model pre - trained on PASCAL VOC at 513x513 resolution for image segmentation tasks.

🚀 Quick Start

You can use the raw model for semantic segmentation. Check out the model hub to find fine - tuned versions for tasks that interest you.

✨ Features

The MobileNet V2 model is small, has low latency, and low power consumption, suitable for running on mobile devices.
It can be used for various tasks such as classification, detection, embeddings, and segmentation.
This repo adds a DeepLabV3+ head to the MobileNetV2 backbone for semantic segmentation.

📚 Documentation

Model description

From the original README:

MobileNets are small, low - latency, low - power models parameterized to meet the resource constraints of a variety of use cases. They can be built upon for classification, detection, embeddings and segmentation similar to how other popular large scale models, such as Inception, are used. MobileNets can be run efficiently on mobile devices [...] MobileNets trade off between latency, size and accuracy while comparing favorably with popular models from the literature.

The model in this repo adds a DeepLabV3+ head to the MobileNetV2 backbone for semantic segmentation.

Intended uses & limitations

You can use the raw model for semantic segmentation. See the model hub to look for fine - tuned versions on a task that interests you.

💻 Usage Examples

Basic Usage

from transformers import MobileNetV2FeatureExtractor, MobileNetV2ForSemanticSegmentation
from PIL import Image
import requests

url = "http://images.cocodataset.org/val2017/000000039769.jpg"
image = Image.open(requests.get(url, stream=True).raw)

feature_extractor = MobileNetV2FeatureExtractor.from_pretrained("Matthijs/deeplabv3_mobilenet_v2_1.0_513")
model = MobileNetV2ForSemanticSegmentation.from_pretrained("Matthijs/deeplabv3_mobilenet_v2_1.0_513")

inputs = feature_extractor(images=image, return_tensors="pt")

outputs = model(**inputs)
logits = outputs.logits
predicted_mask = logits.argmax(1).squeeze(0)

Currently, both the feature extractor and model support PyTorch.

📄 License

License: other

BibTeX entry and citation info

@inproceedings{deeplabv3plus2018,
  title={Encoder - Decoder with Atrous Separable Convolution for Semantic Image Segmentation},
  author={Liang - Chieh Chen and Yukun Zhu and George Papandreou and Florian Schroff and Hartwig Adam},
  booktitle={ECCV},
  year={2018}
}

Information Table

Property	Details
Model Type	MobileNetV2 with DeepLabV3+ for semantic segmentation
Training Data	PASCAL VOC
Tags	vision, image - segmentation

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご