Beyond Independent Relevance
University of Illinois Urbana-Champaign · Carnegie Mellon University
Abstract
We present a non-traditional retrieval problem we call subtopic retrieval. The subtopic retrieval problem is concerned with finding documents that cover many different subtopics of a query topic. In such a problem, the utility of a document in a ranking is dependent on other documents in the ranking, violating the assumption of independent relevance which is assumed in most traditional retrieval methods. Subtopic retrieval poses challenges for evaluating performance, as well as for developing effective algorithms. We propose a framework for evaluating subtopic retrieval which generalizes the traditional precision and recall metrics by accounting for intrinsic topic difficulty as well as redundancy in…
Citation impact
- FWCI
- 97.26
- Percentile
- 100%
- References
- 27
Authors
3Topics & keywords
- Computer science
- Ranking (information retrieval)
- Relevance (law)
- Information retrieval
- Precision and recall
- Redundancy (engineering)
- Baseline (sea)
- Data mining
- Quality Education