M

Markuplm Base Finetuned Websrc

Developed by microsoft
MarkupLM is a multimodal pretrained model designed for rich visual document understanding and information extraction tasks, combining text and markup language information.
Downloads 168
Release Time : 6/14/2022

Model Overview

This model is specifically designed for tasks such as web Q&A and web information extraction, achieving more accurate document understanding by integrating text content and HTML markup structure.

Model Features

Multimodal Understanding
Processes both text content and HTML markup structure simultaneously for more comprehensive document understanding.
Web-Optimized
Specially optimized for web content, delivering excellent performance on datasets like WebSRC.
Concise and Efficient Design
The model features a simple yet highly effective design, achieving SOTA performance across multiple benchmarks.

Model Capabilities

Web Content Understanding
Structured Information Extraction
Web Q&A
Document Intelligence Processing

Use Cases

Web Information Processing
Web Q&A System
Answers user questions based on web content
Achieves outstanding performance on the WebSRC dataset
Web Data Extraction
Extracts structured data from web pages
Document Intelligence
Rich Text Document Analysis
Parses documents with rich formatting
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase