articleJul 25, 2004Closed access

Retrieval evaluation with incomplete information

National Institute of Standards and Technology

Indexed incrossref

Abstract

This paper examines whether the Cranfield evaluation methodology is robust to gross violations of the completeness assumption (i.e., the assumption that all relevant documents within a test collection have been identified and are present in the collection). We show that current evaluation measures are not robust to substantially incomplete relevance judgments. A new measure is introduced that is both highly correlated with existing measures when complete judgments are available and more robust to incomplete judgment sets. This finding suggests that substantially larger or dynamic test collections built using current pooling practices should be viable laboratory tools, despite the fact that the relevance…

Citation impact

738
total citations
FWCI
95.33
Percentile
100%
References
21
Citations per year

Authors

2

Topics & keywords

Keywords
  • Pooling
  • Computer science
  • Relevance (law)
  • Completeness (order theory)
  • Complete information
  • Imperfect
  • Measure (data warehouse)
  • Information retrieval
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.