M

Mambavision T2 1K

Developed by nvidia
The first hybrid computer vision model combining the strengths of Mamba and Transformer, enhancing visual feature modeling through redesigned Mamba formulations and incorporating self-attention modules in the Mamba architecture to improve long-range spatial dependency modeling.
Downloads 597
Release Time : 7/14/2024

Model Overview

MambaVision is a hybrid visual model combining Mamba and Transformer, designed for image feature extraction and classification, featuring efficient visual feature modeling capabilities and excellent performance.

Model Features

Hybrid Architecture Design
Combines the efficient modeling capability of Mamba with the long-range dependency modeling advantage of Transformer for superior visual feature extraction.
Hierarchical Architecture
Features a hierarchical architecture design to meet diverse visual task requirements, supporting applications of varying scales and complexities.
High Performance
Achieves new SOTA Pareto frontiers in Top-1 accuracy and throughput, demonstrating outstanding performance.

Model Capabilities

Image feature extraction
Image classification

Use Cases

Computer Vision
Image Classification
Classifies input images, such as object recognition in the COCO dataset.
Accurately identifies object categories in images, such as brown bears.
Feature Extraction
Extracts multi-stage image features for downstream tasks like object detection and image segmentation.
Outputs four-stage features and final average pooling features, suitable for various visual tasks.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase