articleJul 23, 2002Closed access

Bursty and hierarchical structure in streams

Cornell University

Indexed incrossref

Abstract

A fundamental problem in text data mining is to extract meaningful structure from document streams that arrive continuously over time. E-mail and news articles are two natural examples of such streams, each characterized by topics that appear, grow in intensity for a period of time, and then fade away. The published literature in a particular research field can be seen to exhibit similar phenomena over a much longer time scale. Underlying much of the text mining work in this area is the following intuitive premise --- that the appearance of a topic in a document stream is signaled by a "burst of activity," with certain features rising sharply in frequency as the topic emerges.The goal of the present work is to…

Citation impact

1,136
total citations
FWCI
17.97
Percentile
100%
References
67
Citations per year

Authors

1

Topics & keywords

Keywords
  • Computer science
  • Premise
  • Representation (politics)
  • Data stream mining
  • Set (abstract data type)
  • Field (mathematics)
  • Analogy
  • Topic model
No related works found for this paper.