M

Mambavision T 1K

Developed by nvidia
MambaVision is the first hybrid computer vision model combining the advantages of Mamba and Transformer, significantly enhancing the modeling capability of long-range spatial dependencies through redesigned Mamba formulas and integrated ViT modules.
Downloads 2,323
Release Time : 7/14/2024

Model Overview

MambaVision is a hybrid Mamba-Transformer visual backbone network specifically designed for image classification and feature extraction tasks. It combines the efficient modeling capability of Mamba with the long-range dependency capturing ability of Transformer, achieving new SOTA levels in Top-1 accuracy and throughput.

Model Features

Hybrid Architecture Innovation
First to combine the advantages of Mamba and Transformer, redesigning Mamba formulas to enhance visual feature modeling capabilities
Hierarchical Design
Offers a series of models with hierarchical architectures to meet various design needs
Efficient Long-range Dependency Modeling
Incorporates multiple self-attention modules in the final layer of the Mamba architecture, significantly improving the ability to capture long-range spatial dependencies

Model Capabilities

Image classification
Image feature extraction
Multi-stage feature output

Use Cases

Computer Vision
Image Classification
Classifies and identifies input images, such as recognizing animal species
Successfully identified a brown bear in the example
Feature Extraction
Extracts multi-level feature representations of images for downstream tasks
Can output feature maps from 4 stages and average pooled features
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase