articleACM SIGIR ForumAug 2, 2017Closed access

Scatter/Gather

Palo Alto Research Center · Stanford University · +1 more institution

Indexed incrossref

Abstract

Document clustering has not been well received as an information retrieval tool. Objections to its use fall into two main categories: first, that clustering is too slow for large corpora (with running time often quadratic in the number of documents); and second, that clustering does not appreciably improve retrieval. We argue that these problems arise only when clustering is used in an attempt to improve conventional search techniques. However, looking at clustering as an information access tool in its own right obviates these objections, and provides a powerful new access paradigm. We present a document browsing technique that employs docum-ent clustering as its primary operation. We also present fast (linear…

Citation impact

1,425
total citations
FWCI
22.06
Percentile
100%
References
13
Citations per year

Authors

4

Topics & keywords

Keywords
  • Cluster analysis
  • Computer science
  • Document clustering
  • Information retrieval
  • Data mining
  • Correlation clustering
  • Clustering high-dimensional data
  • Fuzzy clustering
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.