M

Mobilevit X Small

Developed by apple
MobileViT is a lightweight, low-latency vision Transformer model that combines the advantages of CNNs and Transformers, making it suitable for mobile devices.
Downloads 1,062
Release Time : 5/30/2022

Model Overview

This model is pre-trained on the ImageNet-1k dataset for image classification tasks, featuring lightweight and efficient characteristics.

Model Features

Lightweight Design
Optimized for mobile devices with only 2.3M parameters, suitable for deployment in resource-constrained environments.
Hybrid Architecture
Combines MobileNetV2's CNN layers with Transformer modules, enabling both local and global feature processing.
Multi-scale Training
Uses multi-scale sampling (160x160 to 320x320) during training to enhance adaptability to images of different resolutions.

Model Capabilities

Image Classification
Visual Feature Extraction

Use Cases

Computer Vision
Object Recognition
Identifies object categories in images (e.g., animals, everyday items).
Achieves 74.8% top-1 accuracy on ImageNet-1k.
Mobile Vision Applications
Suitable for real-time image classification scenarios on mobile devices like smartphones.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase