L

Layoutlmv3 Finetuned DocLayNet

Developed by Mit1208
A document layout analysis model fine-tuned based on the LayoutLMv3 architecture, specifically designed for document element classification tasks in the DocLayNet dataset.
Downloads 226
Release Time : 3/24/2023

Model Overview

This model is a fine-tuned version based on microsoft/layoutlmv3-base, primarily used for token classification tasks in document images, capable of identifying and classifying different layout elements in documents.

Model Features

Document Layout Understanding
Capable of understanding the visual layout and textual content of documents, identifying different element regions within documents.
Multimodal Processing
Simultaneously processes textual content and visual layout information for more accurate document analysis.
Efficient Fine-tuning
Fine-tuned based on the pre-trained LayoutLMv3 model, delivering excellent performance on specific tasks.

Model Capabilities

Document Layout Analysis
Visual Text Classification
Document Element Recognition

Use Cases

Document Processing
Contract Analysis
Automatically identifies elements such as headings, paragraphs, and signature areas in contract documents.
F1 score reaches 0.6667
Academic Paper Parsing
Extracts sections such as abstracts, main text, figures, and references from academic papers.
Digital Office
Table Recognition
Identifies table regions and content from scanned documents.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase