Lilt-infoxlm-base Open-source Model - A Practical Tool for Structured Document Understanding Tasks

Lilt Infoxlm Base

Developed by SCUT-DLVCLab

LiLT-InfoXLM is a language-agnostic layout transformer model, created by combining the pre-trained InfoXLM with a language-independent layout transformer (LiLT), suitable for structured document understanding tasks.

Multimodal Fusion

Transformers

Open Source License:MIT #Language-agnostic layout #Document understanding #Multimodal fusion

Downloads 110

Release Time : 10/10/2022

Model Overview

This model combines a pre-trained text encoder with a lightweight layout transformer, applicable to tasks such as document image classification, document parsing, and document question answering.

Model Features

Language-agnostic

Can be combined with any pre-trained RoBERTa encoder, supporting multiple languages.

Lightweight layout transformation

Processes document layout information through a lightweight layout transformer, enhancing structured document understanding.

Flexible adaptation

Can flexibly adapt to text encoders of different languages, with strong scalability.

Model Capabilities

Document image classification

Document parsing

Document question answering

Use Cases

Document processing

Document classification

Classify scanned or digital documents, such as invoices, contracts, etc.

Document parsing

Extract key information from structured documents, such as tables, fields, etc.

Document question answering

Answer user questions based on document content.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Lilt Infoxlm Base

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 LiLT-InfoXLM (base-sized model)

🚀 Quick Start

✨ Features

📚 Documentation

Intended uses & limitations

How to use

BibTeX entry and citation info

📄 License