S

Swin Large Patch4 Window7 224 In22k

Developed by microsoft
Swin Transformer is a hierarchical vision transformer based on shifted windows, pretrained on the ImageNet-21k dataset, suitable for image classification tasks.
Downloads 387
Release Time : 3/2/2022

Model Overview

This model constructs hierarchical feature maps by merging image patches in deeper layers and computes self-attention only within local windows, achieving linear computational complexity relative to input image size.

Model Features

Hierarchical Feature Maps
Constructs hierarchical feature maps by merging image patches, suitable for processing visual information at different scales
Local Window Attention
Computes self-attention only within local windows, resulting in linear computational complexity relative to input image size
General Backbone Network
Can serve as a general backbone network for image classification and dense recognition tasks

Model Capabilities

Image Classification
Visual Feature Extraction

Use Cases

Computer Vision
ImageNet Image Classification
Classifies images into one of 21,841 ImageNet-21k categories
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase