V

Vit Base Blur

Developed by WT-MM
This model is a fine-tuned version of google/vit-base-patch16-224-in21k on a blurry image dataset, designed to distinguish between blurry and clear images.
Downloads 92
Release Time : 7/5/2023

Model Overview

This model is used for binary classification tasks, differentiating between 'noisy' (blurry) images and clear images, where 'noisy' images are the result of incomplete or insufficient iterations during image generation by LDM (Latent Diffusion Model).

Model Features

High Accuracy
Achieved an accuracy of 1.0 on the evaluation set, effectively distinguishing between blurry and clear images.
Based on ViT Architecture
Utilizes the Vision Transformer (ViT) architecture, offering robust image processing capabilities.
Few-Shot Fine-Tuning
Achieves excellent performance with fine-tuning on only about 1,000 images.

Model Capabilities

Image Classification
Blurry Image Detection
Binary Classification Task

Use Cases

Image Quality Detection
Generated Image Quality Assessment
Used to evaluate whether images generated by LDM reach sufficient clarity.
Accurately distinguishes between images generated with 30 steps and 10 steps.
Image Preprocessing
Automatic Blurry Image Filtering
Automatically identifies and filters low-quality images in image processing pipelines.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase