articleACM Transactions on Information SystemsJun 1, 2008Closed access

Interpreting TF-IDF term weights as making relevance decisions

Hong Kong Polytechnic University · Chinese University of Hong Kong · +1 more institution

Indexed incrossref

Abstract

A novel probabilistic retrieval model is presented. It forms a basis to interpret the TF-IDF term weights as making relevance decisions. It simulates the local relevance decision-making for every location of a document, and combines all of these “local” relevance decisions as the “document-wide” relevance decision for the document. The significance of interpreting TF-IDF in this way is the potential to: (1) establish a unifying perspective about information retrieval as relevance decision-making; and (2) develop advanced TF-IDF-related term weights for future elaborate retrieval models. Our novel retrieval model is simplified to a basic ranking formula that directly corresponds to the TF-IDF term weights. In…

Citation impact

808
total citations
FWCI
22.76
Percentile
100%
References
86
Citations per year

Authors

4

Topics & keywords

Keywords
  • tf–idf
  • Relevance (law)
  • Term (time)
  • Ranking (information retrieval)
  • Computer science
  • Information retrieval
  • Term Discrimination
  • Vector space model
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.

Funding