Abstract

This paper describes the use of Storm at Twitter. Storm is a real-time fault-tolerant and distributed stream data processing system. Storm is currently being used to run various critical computations in Twitter at scale, and in real-time. This paper describes the architecture of Storm and its methods for distributed scale-out and fault-tolerance. This paper also describes how queries (aka. topologies) are executed in Storm, and presents some operational stories based on running Storm at Twitter. We also present results from an empirical evaluation demonstrating the resilience of Storm in dealing with machine failures. Storm is under active development at Twitter and we also present some potential directions…

Citation impact

856
total citations
FWCI
264.84
Percentile
100%
References
11
Citations per year

Authors

13

Topics & keywords

Keywords
  • Storm
  • Computer science
  • Scale (ratio)
  • Resilience (materials science)
  • AKA
  • Architecture
  • Stream processing
  • Fault tolerance
UN Sustainable Development Goals
  • Climate action
No related works found for this paper.