Y

Yolos Small 300

Developed by hustvl
A small-sized YOLOS model fine-tuned on the COCO 2017 object detection dataset, utilizing Vision Transformer architecture for efficient object detection
Downloads 86
Release Time : 4/26/2022

Model Overview

YOLOS is a Vision Transformer (ViT)-based object detection model trained with DETR loss function, featuring a simple yet high-performance structure

Model Features

Transformer Architecture
Adopts Vision Transformer architecture, breaking through the limitations of traditional CNNs in object detection
Efficient Training
Uses bipartite matching loss function, optimizing query-to-annotation mapping via Hungarian algorithm
Simple Structure
The model has a simple structure but excellent performance, achieving 42 AP detection accuracy even in its base size

Model Capabilities

Image Object Detection
Multi-class Object Recognition
Bounding Box Prediction

Use Cases

Visual Analysis
Scene Monitoring
Real-time detection and localization of multiple object classes in surveillance videos
Smart Retail
Identifying shelf products and analyzing customer behavior
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase