articleDec 1, 2016Closed access

Matrix Profile I: All Pairs Similarity Joins for Time Series: A Unifying View That Includes Motifs, Discords and Shapelets

University of California System · Universidade de São Paulo

Indexed incrossref

Abstract

The all-pairs-similarity-search (or similarity join) problem has been extensively studied for text and a handful of other datatypes. However, surprisingly little progress has been made on similarity joins for time series subsequences. The lack of progress probably stems from the daunting nature of the problem. For even modest sized datasets the obvious nested-loop algorithm can take months, and the typical speed-up techniques in this domain (i.e., indexing, lower-bounding, triangular-inequality pruning and early abandoning) at best produce one or two orders of magnitude speedup. In this work we introduce a novel scalable algorithm for time series subsequence all-pairs-similarity-search. For exceptionally large…

Citation impact

617
total citations
FWCI
29.09
Percentile
100%
References
28
Citations per year

Authors

9

Topics & keywords

Keywords
  • Joins
  • Computer science
  • Search engine indexing
  • Scalability
  • Subsequence
  • Longest common subsequence problem
  • Series (stratigraphy)
  • Nearest neighbor search
No related works found for this paper.