Beit FaceMask Finetuned
B
Beit FaceMask Finetuned
Developed by AkshatSurolia
A Vision Transformer model based on the BEiT architecture, specifically designed for mask detection tasks, fine-tuned on the Face-Mask18K dataset.
Downloads 23
Release Time : 3/2/2022
Model Overview
This model adopts the BEiT architecture, pre-trained on ImageNet-21k through self-supervised learning, and fine-tuned on the Face-Mask18K dataset containing 18,000 images to detect whether a mask is worn in an image.
Model Features
Self-supervised Pre-training
Utilizes BEiT's self-supervised pre-training method to learn general image representations, improving downstream task performance.
Relative Position Encoding
Employs relative position encoding similar to the T5 model, replacing the absolute position encoding in traditional ViT, enhancing model flexibility.
Efficient Fine-tuning
Fine-tuned on the Face-Mask18K dataset, achieving high accuracy with only a small amount of labeled data.
Model Capabilities
Image Classification
Mask Detection
Visual Feature Extraction
Use Cases
Public Health
Public Space Mask-Wearing Detection
Used to monitor whether people in public spaces are wearing masks, assisting in epidemic prevention management.
Evaluation accuracy reaches 97.5%
Smart Security
Access Control System Identity Verification
Combined with facial recognition, detects whether a person is wearing a mask during identity verification.
Featured Recommended AI Models
Š 2025AIbase