Y

Yolos Small Finetuned Masks

Developed by nickmuchi
A small-scale Vision Transformer model based on YOLOS architecture, fine-tuned specifically for mask detection tasks, trained on COCO and mask detection datasets
Downloads 153
Release Time : 6/17/2022

Model Overview

This model is an object detection model based on Vision Transformer (ViT), pre-trained on COCO dataset and specifically fine-tuned for mask detection tasks, capable of identifying three states: 'wearing mask', 'not wearing mask', and 'improperly wearing mask'

Model Features

Efficient Vision Transformer Architecture
Adopts a simple architecture based on ViT, trained with DETR loss function, achieving good detection accuracy while maintaining structural simplicity
Specialized Mask Detection Optimization
Fine-tuned for 200 epochs on a mask dataset with 853 annotated images, optimizing mask-related detection capabilities
Multi-scenario Adaptation
Evaluation results show good detection performance across various object scales (small/medium/large)

Model Capabilities

Image Object Detection
Mask Wearing Status Recognition
Crowd Scene Analysis

Use Cases

Public Health Monitoring
Public Place Mask Wearing Monitoring
Real-time monitoring of mask wearing status in public places like malls and stations
Achieves 53.2% average precision (AP@0.5)
Smart Security
Access Control System
Integrated into access control systems to automatically detect mask wearing status
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase