articleDec 1, 2010Closed access

S4: Distributed Stream Computing Platform

Yahoo (United States)

Indexed incrossref

Abstract

S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop applications for processing continuous unbounded streams of data. Keyed data events are routed with affinity to Processing Elements (PEs), which consume the events and do one or both of the following: (1) emit one or more events which may be consumed by other PEs, (2) publish results. The architecture resembles the Actors model, providing semantics of encapsulation and location transparency, thus allowing applications to be massively concurrent while exposing a simple programming interface to application developers. In this paper, we outline the S4 architecture in detail,…

Citation impact

905
total citations
FWCI
59.13
Percentile
100%
References
17
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Scalability
  • Stream processing
  • Distributed computing
  • Data stream mining
  • Architecture
  • Massively parallel
  • Encapsulation (networking)
No related works found for this paper.