M

Mambavision B 21K

Developed by nvidia
The first hybrid computer vision model combining the strengths of Mamba and Transformer, enhancing visual feature modeling efficiency through reconstructed Mamba formulas and introducing self-attention modules at the end of the Mamba architecture to improve long-range spatial dependency modeling.
Downloads 1,395
Release Time : 3/24/2025

Model Overview

MambaVision is a hierarchical visual backbone network that combines the advantages of Mamba and Transformer, suitable for image classification and feature extraction tasks.

Model Features

Hybrid Architecture Innovation
First combination of Mamba and Transformer, reconstructing Mamba formulas to optimize visual feature modeling efficiency
Hierarchical Structure Design
Provides a series of models with hierarchical structures to meet diverse design needs
Performance Optimization
Introduces self-attention modules at the end of the Mamba architecture, significantly improving long-range spatial dependency modeling

Model Capabilities

Image Classification
Visual Feature Extraction

Use Cases

Computer Vision
Image Classification
Classify input images
Achieves 84.9% Top-1 accuracy on ImageNet-1K
Feature Extraction
Obtain four-stage feature maps and global average pooling features of images
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase