D

Deeplabv3 Mobilevit Small

Developed by apple
Lightweight vision Transformer model combining MobileNetV2 and Transformer modules, suitable for mobile semantic segmentation tasks
Downloads 817
Release Time : 5/30/2022

Model Overview

This model adds a DeepLabV3 head to the MobileViT backbone, specifically designed for semantic segmentation tasks and pre-trained on the PASCAL VOC dataset

Model Features

Lightweight Design
Combines the lightweight characteristics of MobileNetV2 with the global processing capabilities of Transformers, ideal for mobile deployment
Efficient Segmentation
Utilizes DeepLabV3 head structure to achieve precise semantic segmentation while maintaining lightweight
Multi-scale Training
Employs a multi-scale sampling strategy from 160x160 to 320x320 during pre-training to enhance model adaptability

Model Capabilities

Image Semantic Segmentation
Mobile Image Processing
Real-time Scene Understanding

Use Cases

Computer Vision
Autonomous Driving Scene Understanding
Identifies different object categories in road scenes
Achieves 79.1 mIOU on PASCAL VOC
Mobile Image Editing
Enables real-time background replacement/object segmentation on mobile devices
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase