Levit-128S Open-source Visual Model - Combine the Advantages of Convolution to Achieve Fast Image Inference

Levit 128S

Developed by facebook

LeViT-128S is a vision Transformer model pretrained on the ImageNet-1k dataset, combining the advantages of convolutional networks for faster inference.

Image Classification

Transformers

Open Source License:Apache-2.0 #Efficient Image Classification #Lightweight Transformer #Fast Inference

Downloads 3,198

Release Time : 6/1/2022

Model Overview

LeViT is a vision model that integrates convolutional networks and Transformer architectures, designed for image classification tasks, optimizing inference speed while maintaining high accuracy.

Model Features

Hybrid Architecture Design

Combines the strengths of convolutional networks and Transformers to optimize computational efficiency while maintaining performance on vision tasks.

Efficient Inference

Designed for fast inference, with lower computational overhead compared to pure Transformer architectures.

ImageNet Pretraining

Pretrained on the ImageNet-1k dataset, ready for direct use in thousand-class image classification tasks.

Model Capabilities

Image Classification

Visual Feature Extraction

Use Cases

Computer Vision

General Object Recognition

Identify common objects in images (e.g., animals, everyday items)

Can accurately classify 1,000 categories from ImageNet

Scene Understanding

Analyze image scene content (e.g., indoor/outdoor environments, building types)

Property	Details
Model Type	LeViT-128S for image classification
Training Data	ImageNet-1k

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Levit 128S

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 LeViT

🚀 Quick Start

💻 Usage Examples

Basic Usage

📄 License