M

Mambavision L3 512 21K

Developed by nvidia
MambaVision is the first hybrid computer vision model combining the strengths of Mamba and Transformer. It enhances visual feature modeling by redesigning the Mamba formulation and incorporates self-attention modules in the final layers of the Mamba architecture to improve long-range spatial dependency modeling.
Downloads 7,548
Release Time : 3/24/2025

Model Overview

The MambaVision series models are hybrid architectures specifically designed for computer vision tasks, combining Mamba's efficient sequence modeling capabilities with Transformer's self-attention mechanism, suitable for tasks like image classification and feature extraction.

Model Features

Hybrid Architecture Design
Combines Mamba's efficient sequence modeling capabilities with Transformer's self-attention mechanism, incorporating self-attention modules in the final layers of the Mamba architecture to enhance long-range spatial dependency modeling.
Hierarchical Structure
Offers a series of models with hierarchical structures, including models of different scales to meet varying computational resource and performance requirements.
High Performance
Achieves a new SOTA Pareto frontier in Top1 accuracy and throughput, balancing model performance and computational efficiency.

Model Capabilities

Image Classification
Feature Extraction

Use Cases

Computer Vision
Image Classification
Use MambaVision for image classification tasks, such as identifying animal species or object categories.
Achieves 88.1% Top1 accuracy on ImageNet-1K.
Feature Extraction
Extracts four-stage feature maps and global pooling features from images, which can be used for downstream tasks like object detection and image segmentation.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase