articleAug 4, 2023GOLD OA

TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting

IBM Research - India · IBM (United States)

Indexed inarxivcrossref

Abstract

Transformers have gained popularity in time series forecasting for their ability to capture long-sequence interactions. However, their memory and compute-intensive requirements pose a critical bottleneck for long-term forecasting, despite numerous advancements in compute-aware self-attention modules. To address this, we propose TSMixer, a lightweight neural architecture exclusively composed of multi-layer perceptron (MLP) modules. TSMixer is designed for multivariate forecasting and representation learning on patched time series, providing an efficient alternative to Transformers. Our model draws inspiration from the success of MLP-Mixer models in computer vision. We demonstrate the challenges involved in…

Citation impact

221
total citations
FWCI
52.95
Percentile
100%
References
40
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Artificial intelligence
  • Machine learning
  • Transformer
  • Modular design
  • Multilayer perceptron
  • Perceptron
  • Artificial neural network
UN Sustainable Development Goals
  • Industry, innovation and infrastructure
No related works found for this paper.