M

Mambavision L2 1K

Developed by nvidia
MambaVision is the first hybrid computer vision model combining the strengths of Mamba and Transformer. It enhances visual feature modeling by reconstructing the Mamba formulation and incorporates self-attention modules in the final layers of the Mamba architecture to improve long-range spatial dependency modeling.
Downloads 56
Release Time : 7/14/2024

Model Overview

MambaVision is a hybrid vision backbone network that combines the advantages of Mamba and Transformer, primarily used for image classification and feature extraction tasks.

Model Features

Hybrid architecture
Combines Mamba's efficient sequence modeling capability with Transformer's long-range dependency modeling to enhance visual feature extraction.
Hierarchical architecture
Adopts a hierarchical architecture design to meet the needs of different computational resources and performance requirements.
High performance
Establishes a new SOTA Pareto frontier in both Top-1 accuracy and computational throughput.

Model Capabilities

Image classification
Feature extraction

Use Cases

Computer vision
Image classification
Classifies input images and outputs category labels.
Achieves high accuracy on the ImageNet-1K dataset.
Feature extraction
Extracts hierarchical image features for downstream tasks such as object detection and image segmentation.
Supports extraction of four-stage hierarchical features and final flattened features after average pooling.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase