Interpreting TF-IDF term weights as making relevance decisions
Hong Kong Polytechnic University · Chinese University of Hong Kong · +1 more institution
Abstract
A novel probabilistic retrieval model is presented. It forms a basis to interpret the TF-IDF term weights as making relevance decisions. It simulates the local relevance decision-making for every location of a document, and combines all of these “local” relevance decisions as the “document-wide” relevance decision for the document. The significance of interpreting TF-IDF in this way is the potential to: (1) establish a unifying perspective about information retrieval as relevance decision-making; and (2) develop advanced TF-IDF-related term weights for future elaborate retrieval models. Our novel retrieval model is simplified to a basic ranking formula that directly corresponds to the TF-IDF term weights. In…
Citation impact
- FWCI
- 22.76
- Percentile
- 100%
- References
- 86
Authors
4Topics & keywords
- tf–idf
- Relevance (law)
- Term (time)
- Ranking (information retrieval)
- Computer science
- Information retrieval
- Term Discrimination
- Vector space model
- Peace, Justice and strong institutions