LinkBERT: Pretraining Language Models with Document Links

Stanford University

Indexed incrossref

Abstract

Language model (LM) pretraining captures various knowledge from text corpora, helping downstream NLP tasks. However, existing methods such as BERT model a single document, failing to capture document dependencies and knowledge that spans across documents. In this work, we propose LinkBERT, an effective LM pretraining method that incorporates document links, such as hyperlinks. Given a pretraining corpus, we view it as a graph of documents, and create LM inputs by placing linked documents in the same context. We then train the LM with two joint self-supervised tasks: masked language modeling and our newly proposed task, document relation prediction. We study LinkBERT in two domains: general domain (pretrained…

Citation impact

291
total citations
FWCI
28.25
Percentile
100%
References
78
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Hyperlink
  • Language model
  • Natural language processing
  • Artificial intelligence
  • Information retrieval
  • Context (archaeology)
  • Graph
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.

Funding