V

Vit FaceMask Finetuned

Developed by AkshatSurolia
A Vision Transformer model trained on the Face-Mask18K dataset for mask detection tasks, achieving an accuracy of 98.94%.
Downloads 26
Release Time : 3/2/2022

Model Overview

This model adopts the Vision Transformer architecture, pre-trained and fine-tuned on a dataset of 18,000 images, specifically designed to detect whether masks are worn in images.

Model Features

High Accuracy
Achieves an outstanding accuracy of 98.94% on the test set.
Efficient Training
Fast training speed, with a sample training speed of 23.943 per second.
Transformer-based Architecture
Utilizes the advanced Vision Transformer architecture, effectively capturing global image features.

Model Capabilities

Image Classification
Mask Detection

Use Cases

Public Health
Public Place Mask Wearing Detection
Used to detect whether people in public places are wearing masks
Detection accuracy reaches 98.94%
Security Monitoring
Surveillance Video Mask Wearing Analysis
Analyzes mask-wearing situations in surveillance videos
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase