Abstract

Storm has long served as the main platform for real-time analytics at Twitter. However, as the scale of data being processed in real-time at Twitter has increased, along with an increase in the diversity and the number of use cases, many limitations of Storm have become apparent. We need a system that scales better, has better debug-ability, has better performance, and is easier to manage -- all while working in a shared cluster infrastructure. We considered various alternatives to meet these needs, and in the end concluded that we needed to build a new real-time stream data processing system. This paper presents the design and implementation of this new system, called Heron. Heron is now the de facto stream…

Citation impact

547
total citations
FWCI
79.94
Percentile
100%
References
11
Citations per year

Authors

9

Topics & keywords

Keywords
  • Heron
  • Computer science
  • Scalability
  • Stream processing
  • Analytics
  • Debugging
  • Database
  • Distributed computing
UN Sustainable Development Goals
  • Industry, innovation and infrastructure
No related works found for this paper.