V

Visualheist Large

Developed by shixuanleong
VisualHeist is an object detection model specifically designed to extract charts, schematics, and tables from PDF files, including titles, headers, and footers.
Downloads 1,693
Release Time : 10/28/2024

Model Overview

VisualHeist can accurately identify and segment charts and tables in PDF documents by fine-tuning the object detection model, improving the automation level and work efficiency of document processing.

Model Features

Multiple version options
Two model scales, basic and large versions, are provided to meet the requirements of different hardware configurations.
High-quality training data
Fine-tuning is performed using 3435 charts and 1716 tables, and all data is manually annotated.
Wide applicability
It performs well on literature in various disciplinary fields, including chemistry, materials science, biology, etc.

Model Capabilities

PDF document parsing
Chart detection
Table detection
Schematic detection
Academic literature processing

Use Cases

Academic research
Literature data extraction
Automatically extract chart and table data from scientific research papers
The F1 score reaches 93% (overall)
Document processing
PDF content structuring
Automatically classify and extract visual elements in PDF documents
The F1 score reaches 92% on supplementary materials
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase