ResNet-50 Open-source Image Classification Model - Pre-trained on ImageNet, Efficiently Completes Classification Tasks

Resnet 50

Developed by microsoft

ResNet-50 is a residual network model pre-trained on ImageNet-1k, using the v1.5 architecture improvement, suitable for image classification tasks.

Image Classification Open Source License:Apache-2.0 #ResNet optimized version #ImageNet classification #224x224 resolution

Downloads 273.80k

Release Time : 3/16/2022

Model Overview

ResNet-50 is a convolutional neural network that enables deep model training through residual learning and skip connections. The v1.5 version improves accuracy by approximately 0.5% by adjusting the downsampling layer structure.

Model Features

Residual connection design

Uses skip connections to address the vanishing gradient problem in deep networks, enabling the training of ultra-deep networks

v1.5 architecture optimization

Adjustments to the downsampling layer structure improve top-1 accuracy by approximately 0.5%, outperforming the original v1 version

ImageNet pre-training

Pre-trained on the ImageNet-1k dataset, ready for direct use in 1000-class image classification

Model Capabilities

Image classification

Feature extraction

Use Cases

Computer vision

General image classification

Classifies input images into 1000 ImageNet categories

Achieves high accuracy on ImageNet-1k

Transfer learning base model

Can be fine-tuned as a pre-trained model for domain-specific image classification tasks

🚀 ResNet-50 v1.5

A ResNet model pre-trained on ImageNet-1k at a resolution of 224x224, introduced in the paper Deep Residual Learning for Image Recognition by He et al.

Disclaimer: The team releasing ResNet did not write a model card for this model, so this model card has been written by the Hugging Face team.

✨ Features

Vision and Image Classification: This model is designed for vision tasks, specifically image classification.
Pre-trained on ImageNet-1k: It has been pre-trained on the ImageNet-1k dataset at a resolution of 224x224.
Residual Learning and Skip Connections: ResNet democratized the concepts of residual learning and skip connections, enabling the training of much deeper models.
ResNet v1.5: This version differs from the original model, offering slightly higher accuracy (~0.5% top1) but with a small performance drawback (~5% imgs/sec) according to Nvidia.

Property	Details
Model Type	Convolutional Neural Network (ResNet v1.5)
Training Data	ImageNet-1k

📚 Documentation

Model description

ResNet (Residual Network) is a convolutional neural network that popularized the concepts of residual learning and skip connections, allowing for the training of much deeper models.

This is ResNet v1.5, which differs from the original model. In the bottleneck blocks requiring downsampling, v1 has a stride of 2 in the first 1x1 convolution, while v1.5 has a stride of 2 in the 3x3 convolution. This difference makes ResNet50 v1.5 slightly more accurate (~0.5% top1) than v1 but comes with a small performance drawback (~5% imgs/sec) according to Nvidia.

model image

Intended uses & limitations

You can use the raw model for image classification. Check the model hub to find fine-tuned versions for tasks that interest you.

💻 Usage Examples

Basic Usage

# Here is how to use this model to classify an image of the COCO 2017 dataset into one of the 1,000 ImageNet classes:
from transformers import AutoImageProcessor, ResNetForImageClassification
import torch
from datasets import load_dataset

dataset = load_dataset("huggingface/cats-image")
image = dataset["test"]["image"][0]

processor = AutoImageProcessor.from_pretrained("microsoft/resnet-50")
model = ResNetForImageClassification.from_pretrained("microsoft/resnet-50")

inputs = processor(image, return_tensors="pt")

with torch.no_grad():
    logits = model(**inputs).logits

# model predicts one of the 1000 ImageNet classes
predicted_label = logits.argmax(-1).item()
print(model.config.id2label[predicted_label])

For more code examples, refer to the documentation.

BibTeX entry and citation info

@inproceedings{he2016deep,
  title={Deep residual learning for image recognition},
  author={He, Kaiming and Zhang, Xiangyu and Ren, Shaoqing and Sun, Jian},
  booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition},
  pages={770--778},
  year={2016}
}

📄 License

This model is licensed under the Apache-2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご