P

Pix2struct Base Table2html

Developed by KennethTM
A Pix2Struct-based model for converting table images to structured HTML code
Downloads 104
Release Time : 9/10/2024

Model Overview

This model takes table images as input and outputs corresponding HTML code, enabling OCR and structured recognition of table images. Suitable for scenarios requiring table data extraction from images.

Model Features

Table Image Recognition
Accurately recognizes text and structure in table images
HTML Generation
Converts recognition results into structured HTML code
Multi-dataset Training
Trained on both MMTab and PubTabNet datasets for improved generalization
1024 Chunk Length
Supports up to 1024 chunk length, suitable for complex tables

Model Capabilities

Table Image Recognition
HTML Code Generation
Table Structure Parsing
Multilingual Table Processing

Use Cases

Document Digitization
PDF Table Extraction
Extract tables from PDF documents and convert them to HTML format
Generates editable HTML table code
Data Collection
Web Table Scraping
Convert tables from webpage screenshots into structured data
Obtain directly usable table data
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase