T

TFT ID 1.0

Developed by yifeihu
TFT-ID is a fine-tuned object detection model specifically designed for detecting tables, figures, and text regions in academic papers, based on Florence-2 fine-tuning
Downloads 153
Release Time : 7/25/2024

Model Overview

This model can identify tables, figures, and text regions in academic paper pages, outputting bounding box information. Text regions can be directly fed into OCR processes

Model Features

High-Precision Detection
Achieves a 98.84% success rate in table/figure recognition tasks
Multi-Region Recognition
Simultaneously detects tables, figures, and text regions
Manually Annotated Data
Training data includes over 36,000 manually annotated and verified bounding boxes
OCR Integration
Text regions can be directly fed into OCR processes, with the TB-OCR-preview-0.1 model recommended

Model Capabilities

Academic paper image analysis
Table detection
Figure detection
Text region detection
Bounding box output

Use Cases

Academic Research
Paper Content Analysis
Automatically identifies tables, figures, and text regions in papers
Helps researchers quickly locate and extract key information from papers
Literature Digitization
Converts paper or PDF documents into structured digital content
Improves literature processing efficiency for subsequent analysis and retrieval
Publishing Industry
Journal Layout Verification
Automatically checks if the positions of figures and tables in papers meet publishing requirements
Reduces manual inspection workload and improves publishing efficiency
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase