M

Mambavision S 1K

Developed by nvidia
The first hybrid computer vision model combining the advantages of Mamba and Transformer, enhancing visual feature modeling efficiency by reconstructing the Mamba formula, and improving long-range spatial dependency modeling by adding a self-attention module at the end of the Mamba architecture.
Downloads 908
Release Time : 7/14/2024

Model Overview

MambaVision is a visual backbone network that combines the strengths of Mamba and Transformer, primarily used for image classification and feature extraction tasks, featuring efficient visual feature modeling and long-range spatial dependency processing capabilities.

Model Features

Hybrid Architecture
Combines the advantages of Mamba and Transformer, reconstructing the Mamba formula to enhance visual feature modeling efficiency.
Long-range Spatial Dependency Modeling
Adds a self-attention module at the end of the Mamba architecture, significantly improving long-range spatial dependency modeling.
Hierarchical Architecture
Offers the MambaVision series models with a hierarchical architecture to meet diverse design needs.
High Performance
Achieves a new SOTA Pareto frontier in Top-1 accuracy and throughput.

Model Capabilities

Image Classification
Feature Extraction
Multi-stage Feature Output

Use Cases

Computer Vision
Image Classification
Use MambaVision for image classification, such as identifying animal species.
Predicted class: Brown bear (brown bear, bruin, Ursus arctos)
Feature Extraction
Use MambaVision as a general feature extractor to obtain four-stage hierarchical features and final average pooling features.
Can obtain feature maps from four stages and average pooling features
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase