preprintarXiv (Cornell University)Jun 29, 2019GREEN OA

Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting

University of California, Santa Barbara

Indexed inarxivdatacite

Abstract

Time series forecasting is an important problem across many domains, including predictions of solar plant energy output, electricity consumption, and traffic jam situation. In this paper, we propose to tackle such forecasting problem with Transformer [1]. Although impressed by its performance in our preliminary study, we found its two major weaknesses: (1) locality-agnostics: the point-wise dot-product self-attention in canonical Transformer architecture is insensitive to local context, which can make the model prone to anomalies in time series; (2) memory bottleneck: space complexity of canonical Transformer grows quadratically with sequence length $L$, making directly modeling long time series infeasible. In…

Citation impact

1,008
total citations
FWCI
Percentile
References
35
Citations per year

Authors

7

Topics & keywords

Keywords
  • Computer science
  • Locality
  • Bottleneck
  • Transformer
  • Granularity
  • Time series
  • Quadratic growth
  • Algorithm
UN Sustainable Development Goals
  • Affordable and clean energy
No related works found for this paper.