A Markov random field model for term dependencies
University of Massachusetts Amherst
Abstract
This paper develops a general, formal framework for modeling term dependencies via Markov random fields. The model allows for arbitrary text features to be incorporated as evidence. In particular, we make use of features based on occurrences of single terms, ordered phrases, and unordered phrases. We explore full independence, sequential dependence, and full dependence variants of the model. A novel approach is developed to train the model that directly maximizes the mean average precision rather than maximizing the likelihood of the training data. Ad hoc retrieval experiments are presented on several newswire and web collections, including the GOV2 collection used at the TREC 2004 Terabyte Track. The results…
Citation impact
- FWCI
- 74.64
- Percentile
- 100%
- References
- 31
Authors
2Topics & keywords
- Terabyte
- Computer science
- Term (time)
- Markov random field
- Independence (probability theory)
- Markov chain
- Topic model
- Field (mathematics)
- Peace, Justice and strong institutions