D

Deeplabv3 Mobilevit X Small

Developed by apple
A lightweight vision Transformer model combining MobileNetV2 and Transformer modules, suitable for mobile semantic segmentation tasks.
Downloads 268
Release Time : 5/30/2022

Model Overview

This model is pre-trained on the PASCAL VOC dataset, combining the lightweight architecture of MobileViT with the semantic segmentation capabilities of DeepLabV3, making it ideal for image segmentation applications in resource-constrained environments.

Model Features

Lightweight Design
Combines the lightweight convolutional layers of MobileNetV2 with the global processing capabilities of Transformers, making it suitable for mobile deployment.
Efficient Segmentation
Utilizes the DeepLabV3 head structure to achieve high-quality semantic segmentation while maintaining a lightweight footprint.
No Positional Encoding Required
The MobileViT module can be embedded anywhere in a CNN without requiring additional positional encoding.

Model Capabilities

Image Semantic Segmentation
Mobile Image Processing
Real-time Segmentation Tasks

Use Cases

Computer Vision
Scene Understanding
Performs pixel-level classification of different objects in an image, suitable for applications like autonomous driving and surveillance.
Achieves 77.1 mIOU on the PASCAL VOC dataset
Mobile Image Processing
Enables real-time semantic segmentation on resource-constrained devices.
Only 2.9M parameters, making it suitable for mobile deployment
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase