TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting
IBM Research - India · IBM (United States)
Abstract
Transformers have gained popularity in time series forecasting for their ability to capture long-sequence interactions. However, their memory and compute-intensive requirements pose a critical bottleneck for long-term forecasting, despite numerous advancements in compute-aware self-attention modules. To address this, we propose TSMixer, a lightweight neural architecture exclusively composed of multi-layer perceptron (MLP) modules. TSMixer is designed for multivariate forecasting and representation learning on patched time series, providing an efficient alternative to Transformers. Our model draws inspiration from the success of MLP-Mixer models in computer vision. We demonstrate the challenges involved in…
Citation impact
- FWCI
- 52.95
- Percentile
- 100%
- References
- 40
Authors
5Topics & keywords
- Computer science
- Artificial intelligence
- Machine learning
- Transformer
- Modular design
- Multilayer perceptron
- Perceptron
- Artificial neural network
- Industry, innovation and infrastructure