ConvNeXT-tiny-224 Open-Source Image Recognition Model - Powerful Performance on Par with Transformer!

Convnext Tiny 224

Developed by facebook

ConvNeXT is a pure convolutional model inspired by Vision Transformer designs, trained on the ImageNet-1k dataset, outperforming Transformers.

Image Classification

Transformers

Open Source License:Apache-2.0 #Image Classification #Convolution Optimization #Lightweight

Downloads 18.67k

Release Time : 3/2/2022

Model Overview

ConvNeXT is a modern convolutional neural network designed for image classification tasks, delivering excellent performance on the ImageNet-1k dataset.

Model Features

Pure Convolutional Architecture

Adopts a pure convolutional design, avoiding the computational complexity of Transformers.

Modern Design

Builds upon ResNet while incorporating modern improvements inspired by Swin Transformer design principles.

High Performance

Outperforms comparable Transformer models on the ImageNet-1k dataset.

Model Capabilities

Image Classification

Visual Feature Extraction

Use Cases

Computer Vision

Object Recognition

Identify object categories in images

Excellent performance on the ImageNet-1k dataset

Image Classification System

Build automated image classification systems

🚀 ConvNeXT (tiny-sized model)

ConvNeXT model trained on ImageNet - 1k at resolution 224x224, offering high - performance image classification capabilities.

🚀 Quick Start

The ConvNeXT model is designed for image classification. You can use the raw model directly for this task. You can also explore fine - tuned versions on the model hub according to your specific needs.

✨ Features

Innovative Design: Inspired by Vision Transformers, ConvNeXT modernizes the ResNet design, aiming to outperform traditional convolutional models.
Image Classification: Capable of classifying images into one of the 1,000 ImageNet classes.

📚 Documentation

Model description

ConvNeXT is a pure convolutional model (ConvNet). It takes inspiration from the design of Vision Transformers and claims to outperform them. The authors started with a ResNet and "modernized" its design, drawing inspiration from the Swin Transformer.

model image

Intended uses & limitations

You can use the raw model for image classification. To find fine - tuned versions for tasks that interest you, check the model hub.

How to use

Here is a code example demonstrating how to use this model to classify an image from the COCO 2017 dataset into one of the 1,000 ImageNet classes:

from transformers import ConvNextImageProcessor, ConvNextForImageClassification
import torch
from datasets import load_dataset

dataset = load_dataset("huggingface/cats-image")
image = dataset["test"]["image"][0]

processor = ConvNextImageProcessor.from_pretrained("facebook/convnext-tiny-224")
model = ConvNextForImageClassification.from_pretrained("facebook/convnext-tiny-224")

inputs = processor(image, return_tensors="pt")

with torch.no_grad():
    logits = model(**inputs).logits

# model predicts one of the 1000 ImageNet classes
predicted_label = logits.argmax(-1).item()
print(model.config.id2label[predicted_label]),

For more code examples, refer to the documentation.

BibTeX entry and citation info

@article{DBLP:journals/corr/abs-2201-03545,
  author    = {Zhuang Liu and
               Hanzi Mao and
               Chao{-}Yuan Wu and
               Christoph Feichtenhofer and
               Trevor Darrell and
               Saining Xie},
  title     = {A ConvNet for the 2020s},
  journal   = {CoRR},
  volume    = {abs/2201.03545},
  year      = {2022},
  url       = {https://arxiv.org/abs/2201.03545},
  eprinttype = {arXiv},
  eprint    = {2201.03545},
  timestamp = {Thu, 20 Jan 2022 14:21:35 +0100},
  biburl    = {https://dblp.org/rec/journals/corr/abs-2201-03545.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

📄 License

This project is licensed under the Apache - 2.0 license.

📄 Additional Information

Property	Details
Model Type	ConvNext (tiny - sized model)
Training Data	ImageNet - 1k
Tags	vision, image - classification

⚠️ Important Note

The team releasing ConvNeXT did not write a model card for this model, so this model card has been written by the Hugging Face team.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご