articleAug 15, 2005GREEN OA

A Markov random field model for term dependencies

University of Massachusetts Amherst

Indexed incrossref

Abstract

This paper develops a general, formal framework for modeling term dependencies via Markov random fields. The model allows for arbitrary text features to be incorporated as evidence. In particular, we make use of features based on occurrences of single terms, ordered phrases, and unordered phrases. We explore full independence, sequential dependence, and full dependence variants of the model. A novel approach is developed to train the model that directly maximizes the mean average precision rather than maximizing the likelihood of the training data. Ad hoc retrieval experiments are presented on several newswire and web collections, including the GOV2 collection used at the TREC 2004 Terabyte Track. The results…

Citation impact

848
total citations
FWCI
74.64
Percentile
100%
References
31
Citations per year

Authors

2

Topics & keywords

Keywords
  • Terabyte
  • Computer science
  • Term (time)
  • Markov random field
  • Independence (probability theory)
  • Markov chain
  • Topic model
  • Field (mathematics)
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.

Funding