V

Vit Receipts Classifier

Developed by jjmcarrascosa
A binary classification model based on ViT architecture for identifying whether an image is a receipt/invoice
Downloads 75
Release Time : 8/26/2022

Model Overview

This model is a fine-tuned version based on the ViT architecture, specifically designed for binary classification tasks of receipt vs. non-receipt images. It demonstrates outstanding performance on the evaluation set with an F1 score of 0.9991.

Model Features

High-Precision Classification
Achieves an F1 score of 0.9991 on the evaluation set, accurately distinguishing between receipt and non-receipt images.
Multi-Source Data Training
Trained using multiple datasets including CORD, RVL-CDIP, and Visual-Genome to enhance model generalization.
Adaptability to Various Image Formats
Capable of processing scanned documents, photographs, or color/grayscale images captured by mobile devices.

Model Capabilities

Image Classification
Receipt Recognition
Invoice Detection

Use Cases

Document Processing
Automatic Receipt Classification
Automatically identifies and classifies uploaded receipt images in corporate financial systems.
Accuracy as high as 99.9%
Receipt Management System
Combined with OCR technology to build an end-to-end receipt information extraction system.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase