Levit-128 Open-Source Image Classification Model - Combining the Advantages of Convolution for Efficient Image Inference

Levit 128

Developed by facebook

LeViT-128 is an image classification model based on the Vision Transformer architecture, achieving efficient inference by combining the advantages of convolutional networks.

Image Classification

Transformers

Open Source License:Apache-2.0 #Efficient Vision Transformer #Fast Image Classification #ImageNet Pretrained

Downloads 44

Release Time : 6/1/2022

Model Overview

The LeViT-128 model is pretrained on the ImageNet-1k dataset at 224x224 resolution and can classify images into 1,000 categories.

Model Features

Efficient Inference

Achieves faster inference speed than traditional Vision Transformers by leveraging the advantages of convolutional networks.

Hybrid Architecture

Innovatively combines Transformer and convolutional networks, incorporating the strengths of both.

Model Capabilities

Image Classification

Visual Feature Extraction

Use Cases

Computer Vision

Object Recognition

Identify object categories in images

Can accurately classify 1,000 categories from ImageNet

Visual Content Analysis

Analyze image content and extract features

Property	Details
Model Type	Vision Transformer for Image Classification
Training Data	ImageNet-1k

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Levit 128

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 LeViT

🚀 Quick Start

💻 Usage Examples

Basic Usage

📄 License