LinkBERT-base Open-source Pre-trained Model - Utilizing Hyperlink Information to Enhance Performance on Knowledge Tasks

Linkbert Base

Developed by michiyasunaga

LinkBERT is an improved BERT model pre-trained on English Wikipedia and hyperlink information, enhancing performance on knowledge-intensive tasks by capturing cross-document associations.

Large Language Model

Transformers

EnglishOpen Source License:Apache-2.0 #Cross-document association #Knowledge-intensive tasks #Hyperlink pre-training

Downloads 195

Release Time : 3/8/2022

Model Overview

LinkBERT is a Transformer encoder-based model that strengthens cross-document knowledge associations by incorporating document links (e.g., hyperlinks), suitable for tasks like question answering and text classification.

Model Features

Cross-document link pre-training

Incorporates related documents into the same context using hyperlinks to enhance knowledge associations

BERT architecture compatible

Can directly replace BERT without modifying downstream task code

Optimized for knowledge-intensive tasks

Significantly outperforms original BERT in tasks like question answering and reading comprehension

Model Capabilities

Text feature extraction

Masked language modeling

Question answering system construction

Text classification

Sequence labeling

Use Cases

Knowledge-intensive tasks

Open-domain question answering

Leverages cross-document associations to answer complex questions

2.2 points higher than BERT-base on HotpotQA

Document retrieval

Document relevance ranking based on link relationships

General NLP tasks

Text classification

Sentiment analysis, topic classification, etc.

0.4 points higher than BERT-base on GLUE average score

🚀 LinkBERT-base

LinkBERT-base is a pre - trained model leveraging English Wikipedia articles and hyperlink information. It offers enhanced performance in various NLP tasks by capturing document links.

🚀 Quick Start

LinkBERT-base is a model pre - trained on English Wikipedia articles with hyperlink information. It was introduced in the paper LinkBERT: Pretraining Language Models with Document Links (ACL 2022). The code and data are available in this repository.

✨ Features

LinkBERT is a transformer encoder (BERT - like) model pre - trained on a large document corpus. It improves upon BERT by capturing document links such as hyperlinks and citation links, incorporating knowledge across multiple documents.
It can serve as a drop - in replacement for BERT. It performs better in general language understanding tasks (e.g., text classification), and is especially effective for knowledge - intensive tasks (e.g., question answering) and cross - document tasks (e.g., reading comprehension, document retrieval).

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

To use the model to get the features of a given text in PyTorch:

from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained('michiyasunaga/LinkBERT-base')
model = AutoModel.from_pretrained('michiyasunaga/LinkBERT-base')
inputs = tokenizer("Hello, my dog is cute", return_tensors="pt")
outputs = model(**inputs)
last_hidden_states = outputs.last_hidden_state

Advanced Usage

For fine - tuning, you can use this repository or follow any other BERT fine - tuning codebases.

📚 Documentation

Intended uses & limitations

The model can be used by fine - tuning on a downstream task, such as question answering, sequence classification, and token classification. You can also use the raw model for feature extraction (i.e., obtaining embeddings for input text).

Evaluation results

When fine - tuned on downstream tasks, LinkBERT achieves the following results.

General benchmarks (MRQA and GLUE):

	HotpotQA	TriviaQA	SearchQA	NaturalQ	NewsQA	SQuAD	GLUE
	F1	F1	F1	F1	F1	F1	Avg score
BERT - base	76.0	70.3	74.2	76.5	65.7	88.7	79.2
LinkBERT - base	78.2	73.9	76.8	78.3	69.3	90.1	79.6
BERT - large	78.1	73.7	78.3	79.0	70.9	91.1	80.7
LinkBERT - large	80.8	78.2	80.5	81.0	72.6	92.7	81.1

🔧 Technical Details

LinkBERT is a transformer encoder (BERT - like) model. It is an improvement of BERT that newly captures document links such as hyperlinks and citation links to include knowledge that spans across multiple documents. Specifically, it was pretrained by feeding linked documents into the same language model context, besides a single document.

📄 License

The model is licensed under the Apache - 2.0 license.

📚 Additional Information

Datasets

wikipedia
bookcorpus

📖 Citation

If you find LinkBERT useful in your project, please cite the following:

@InProceedings{yasunaga2022linkbert,
  author =  {Michihiro Yasunaga and Jure Leskovec and Percy Liang},
  title =   {LinkBERT: Pretraining Language Models with Document Links},
  year =    {2022},  
  booktitle = {Association for Computational Linguistics (ACL)},  
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご