L

Lilt Infoxlm Base

Developed by SCUT-DLVCLab
LiLT-InfoXLM is a language-agnostic layout transformer model, created by combining the pre-trained InfoXLM with a language-independent layout transformer (LiLT), suitable for structured document understanding tasks.
Downloads 110
Release Time : 10/10/2022

Model Overview

This model combines a pre-trained text encoder with a lightweight layout transformer, applicable to tasks such as document image classification, document parsing, and document question answering.

Model Features

Language-agnostic
Can be combined with any pre-trained RoBERTa encoder, supporting multiple languages.
Lightweight layout transformation
Processes document layout information through a lightweight layout transformer, enhancing structured document understanding.
Flexible adaptation
Can flexibly adapt to text encoders of different languages, with strong scalability.

Model Capabilities

Document image classification
Document parsing
Document question answering

Use Cases

Document processing
Document classification
Classify scanned or digital documents, such as invoices, contracts, etc.
Document parsing
Extract key information from structured documents, such as tables, fields, etc.
Document question answering
Answer user questions based on document content.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase