Model Selection

Convolution-Enhanced Transformer

# Convolution-Enhanced Transformer

LeViT-192 is a vision model that combines convolutional neural networks and Transformer architecture, focusing on image classification tasks.

Image Classification

CvT-21 is a vision model combining convolutional and Transformer architectures, pretrained on ImageNet-22k and fine-tuned on ImageNet-1k

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase