Abstract

The production environment for analytical data management applications is rapidly changing. Many enterprises are shifting away from deploying their analytical databases on high-end proprietary machines, and moving towards cheaper, lower-end, commodity hardware, typically arranged in a shared-nothing MPP architecture, often in a virtualized environment inside public or private "clouds". At the same time, the amount of data that needs to be analyzed is exploding, requiring hundreds to thousands of machines to work in parallel to perform the analysis. There tend to be two schools of thought regarding what technology to use for data analysis in such an environment. Proponents of parallel databases argue that the…

Citation impact

856
total citations
FWCI
202.60
Percentile
100%
References
9
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Scalability
  • Flexibility (engineering)
  • Fault tolerance
  • Distributed computing
  • Architecture
  • Big data
  • Database
UN Sustainable Development Goals
  • Industry, innovation and infrastructure
No related works found for this paper.