TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Are Transformers Effective for Time Series Forecasting?

Are Transformers Effective for Time Series Forecasting?

Ailing Zeng, Muxi Chen, Lei Zhang, Qiang Xu

2022-05-26Relation ExtractionTime Series ForecastingAnomaly DetectionTime SeriesTime Series AnalysisTemporal Relation Extraction
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCode(official)Code

Abstract

Recently, there has been a surge of Transformer-based solutions for the long-term time series forecasting (LTSF) task. Despite the growing performance over the past few years, we question the validity of this line of research in this work. Specifically, Transformers is arguably the most successful solution to extract the semantic correlations among the elements in a long sequence. However, in time series modeling, we are to extract the temporal relations in an ordered set of continuous points. While employing positional encoding and using tokens to embed sub-series in Transformers facilitate preserving some ordering information, the nature of the \emph{permutation-invariant} self-attention mechanism inevitably results in temporal information loss. To validate our claim, we introduce a set of embarrassingly simple one-layer linear models named LTSF-Linear for comparison. Experimental results on nine real-life datasets show that LTSF-Linear surprisingly outperforms existing sophisticated Transformer-based LTSF models in all cases, and often by a large margin. Moreover, we conduct comprehensive empirical studies to explore the impacts of various design elements of LTSF models on their temporal relation extraction capability. We hope this surprising finding opens up new research directions for the LTSF task. We also advocate revisiting the validity of Transformer-based solutions for other time series analysis tasks (e.g., anomaly detection) in the future. Code is available at: \url{https://github.com/cure-lab/LTSF-Linear}.

Results

TaskDatasetMetricValueModel
Time Series ForecastingETTh2 (336) UnivariateMAE0.355NLinear
Time Series ForecastingETTh2 (336) UnivariateMSE0.194NLinear
Time Series ForecastingETTh2 (336) UnivariateMAE0.367DLinear
Time Series ForecastingETTh2 (336) UnivariateMSE0.209DLinear
Time Series ForecastingETTh2 (720) MultivariateMAE0.436NLinear
Time Series ForecastingETTh2 (720) MultivariateMSE0.394NLinear
Time Series ForecastingETTh2 (720) MultivariateMAE0.551DLinear
Time Series ForecastingETTh2 (720) MultivariateMSE0.605DLinear
Time Series ForecastingETTh1 (720) MultivariateMAE0.453NLinear
Time Series ForecastingETTh1 (720) MultivariateMSE0.44NLinear
Time Series ForecastingETTh1 (720) MultivariateMAE0.49DLinear
Time Series ForecastingETTh1 (720) MultivariateMSE0.472DLinear
Time Series ForecastingWeather (192)MSE0.22DLinear
Time Series ForecastingWeather (336)MSE0.265DLinear
Time Series ForecastingElectricity (336)MSE0.169DLinear
Time Series ForecastingWeather (720)MSE0.323DLinear
Time Series ForecastingETTh2 (336) MultivariateMAE0.4NLinear
Time Series ForecastingETTh2 (336) MultivariateMSE0.357NLinear
Time Series ForecastingETTh2 (336) MultivariateMAE0.465DLinear
Time Series ForecastingETTh2 (336) MultivariateMSE0.448DLinear
Time Series ForecastingETTh1 (720) UnivariateMAE0.226NLinear
Time Series ForecastingETTh1 (720) UnivariateMSE0.08NLinear
Time Series ForecastingETTh1 (720) UnivariateMAE0.359DLinear
Time Series ForecastingETTh1 (720) UnivariateMSE0.189DLinear
Time Series ForecastingETTh1 (96) UnivariateMAE0.177NLinear
Time Series ForecastingETTh1 (96) UnivariateMSE0.053NLinear
Time Series ForecastingETTh1 (96) UnivariateMAE0.18DLinear
Time Series ForecastingETTh1 (96) UnivariateMSE0.056DLinear
Time Series ForecastingETTh1 (192) MultivariateMAE0.416DLinear
Time Series ForecastingETTh1 (192) MultivariateMSE0.405DLinear
Time Series ForecastingETTh1 (192) MultivariateMAE0.415NLinear
Time Series ForecastingETTh1 (192) MultivariateMSE0.408NLinear
Time Series ForecastingETTh2 (192) UnivariateMAE0.324NLinear
Time Series ForecastingETTh2 (192) UnivariateMSE0.169NLinear
Time Series ForecastingETTh2 (192) UnivariateMAE0.329DLinear
Time Series ForecastingETTh2 (192) UnivariateMSE0.176DLinear
Time Series ForecastingElectricity (192)MSE0.153DLinear
Time Series ForecastingETTh1 (192) UnivariateMAE0.204DLinear
Time Series ForecastingETTh1 (192) UnivariateMSE0.071DLinear
Time Series ForecastingETTh1 (336) MultivariateMAE0.427NLinear
Time Series ForecastingETTh1 (336) MultivariateMSE0.429NLinear
Time Series ForecastingETTh1 (336) MultivariateMAE0.443DLinear
Time Series ForecastingETTh1 (336) MultivariateMSE0.439DLinear
Time Series ForecastingETTh2 (96) MultivariateMAE0.338NLinear
Time Series ForecastingETTh2 (96) MultivariateMSE0.277NLinear
Time Series ForecastingETTh2 (96) MultivariateMAE0.353DLinear
Time Series ForecastingETTh2 (96) MultivariateMSE0.289DLinear
Time Series ForecastingWeather (96)MSE0.176DLinear
Time Series ForecastingETTh2 (720) UnivariateMAE0.381NLinear
Time Series ForecastingETTh2 (720) UnivariateMSE0.225NLinear
Time Series ForecastingETTh2 (720) UnivariateMAE0.426DLinear
Time Series ForecastingETTh2 (720) UnivariateMSE0.276DLinear
Time Series ForecastingETTh1 (336) UnivariateMAE0.226NLinear
Time Series ForecastingETTh1 (336) UnivariateMSE0.081NLinear
Time Series ForecastingETTh1 (336) UnivariateMAE0.244DLinear
Time Series ForecastingETTh1 (336) UnivariateMSE0.098DLinear
Time Series ForecastingElectricity (96)MSE0.14DLinear
Time Series ForecastingETTh2 (192) MultivariateMAE0.381NLinear
Time Series ForecastingETTh2 (192) MultivariateMSE0.344NLinear
Time Series ForecastingETTh2 (192) MultivariateMAE0.418DLinear
Time Series ForecastingETTh2 (192) MultivariateMSE0.383DLinear
Time Series ForecastingETTh2 (96) UnivariateMAE0.278NLinear
Time Series ForecastingETTh2 (96) UnivariateMSE0.129NLinear
Time Series ForecastingETTh2 (96) UnivariateMAE0.279DLinear
Time Series ForecastingETTh2 (96) UnivariateMSE0.131DLinear
Time Series ForecastingElectricity (720)MSE0.203DLinear
Time Series AnalysisETTh2 (336) UnivariateMAE0.355NLinear
Time Series AnalysisETTh2 (336) UnivariateMSE0.194NLinear
Time Series AnalysisETTh2 (336) UnivariateMAE0.367DLinear
Time Series AnalysisETTh2 (336) UnivariateMSE0.209DLinear
Time Series AnalysisETTh2 (720) MultivariateMAE0.436NLinear
Time Series AnalysisETTh2 (720) MultivariateMSE0.394NLinear
Time Series AnalysisETTh2 (720) MultivariateMAE0.551DLinear
Time Series AnalysisETTh2 (720) MultivariateMSE0.605DLinear
Time Series AnalysisETTh1 (720) MultivariateMAE0.453NLinear
Time Series AnalysisETTh1 (720) MultivariateMSE0.44NLinear
Time Series AnalysisETTh1 (720) MultivariateMAE0.49DLinear
Time Series AnalysisETTh1 (720) MultivariateMSE0.472DLinear
Time Series AnalysisWeather (192)MSE0.22DLinear
Time Series AnalysisWeather (336)MSE0.265DLinear
Time Series AnalysisElectricity (336)MSE0.169DLinear
Time Series AnalysisWeather (720)MSE0.323DLinear
Time Series AnalysisETTh2 (336) MultivariateMAE0.4NLinear
Time Series AnalysisETTh2 (336) MultivariateMSE0.357NLinear
Time Series AnalysisETTh2 (336) MultivariateMAE0.465DLinear
Time Series AnalysisETTh2 (336) MultivariateMSE0.448DLinear
Time Series AnalysisETTh1 (720) UnivariateMAE0.226NLinear
Time Series AnalysisETTh1 (720) UnivariateMSE0.08NLinear
Time Series AnalysisETTh1 (720) UnivariateMAE0.359DLinear
Time Series AnalysisETTh1 (720) UnivariateMSE0.189DLinear
Time Series AnalysisETTh1 (96) UnivariateMAE0.177NLinear
Time Series AnalysisETTh1 (96) UnivariateMSE0.053NLinear
Time Series AnalysisETTh1 (96) UnivariateMAE0.18DLinear
Time Series AnalysisETTh1 (96) UnivariateMSE0.056DLinear
Time Series AnalysisETTh1 (192) MultivariateMAE0.416DLinear
Time Series AnalysisETTh1 (192) MultivariateMSE0.405DLinear
Time Series AnalysisETTh1 (192) MultivariateMAE0.415NLinear
Time Series AnalysisETTh1 (192) MultivariateMSE0.408NLinear
Time Series AnalysisETTh2 (192) UnivariateMAE0.324NLinear
Time Series AnalysisETTh2 (192) UnivariateMSE0.169NLinear
Time Series AnalysisETTh2 (192) UnivariateMAE0.329DLinear
Time Series AnalysisETTh2 (192) UnivariateMSE0.176DLinear
Time Series AnalysisElectricity (192)MSE0.153DLinear
Time Series AnalysisETTh1 (192) UnivariateMAE0.204DLinear
Time Series AnalysisETTh1 (192) UnivariateMSE0.071DLinear
Time Series AnalysisETTh1 (336) MultivariateMAE0.427NLinear
Time Series AnalysisETTh1 (336) MultivariateMSE0.429NLinear
Time Series AnalysisETTh1 (336) MultivariateMAE0.443DLinear
Time Series AnalysisETTh1 (336) MultivariateMSE0.439DLinear
Time Series AnalysisETTh2 (96) MultivariateMAE0.338NLinear
Time Series AnalysisETTh2 (96) MultivariateMSE0.277NLinear
Time Series AnalysisETTh2 (96) MultivariateMAE0.353DLinear
Time Series AnalysisETTh2 (96) MultivariateMSE0.289DLinear
Time Series AnalysisWeather (96)MSE0.176DLinear
Time Series AnalysisETTh2 (720) UnivariateMAE0.381NLinear
Time Series AnalysisETTh2 (720) UnivariateMSE0.225NLinear
Time Series AnalysisETTh2 (720) UnivariateMAE0.426DLinear
Time Series AnalysisETTh2 (720) UnivariateMSE0.276DLinear
Time Series AnalysisETTh1 (336) UnivariateMAE0.226NLinear
Time Series AnalysisETTh1 (336) UnivariateMSE0.081NLinear
Time Series AnalysisETTh1 (336) UnivariateMAE0.244DLinear
Time Series AnalysisETTh1 (336) UnivariateMSE0.098DLinear
Time Series AnalysisElectricity (96)MSE0.14DLinear
Time Series AnalysisETTh2 (192) MultivariateMAE0.381NLinear
Time Series AnalysisETTh2 (192) MultivariateMSE0.344NLinear
Time Series AnalysisETTh2 (192) MultivariateMAE0.418DLinear
Time Series AnalysisETTh2 (192) MultivariateMSE0.383DLinear
Time Series AnalysisETTh2 (96) UnivariateMAE0.278NLinear
Time Series AnalysisETTh2 (96) UnivariateMSE0.129NLinear
Time Series AnalysisETTh2 (96) UnivariateMAE0.279DLinear
Time Series AnalysisETTh2 (96) UnivariateMSE0.131DLinear
Time Series AnalysisElectricity (720)MSE0.203DLinear

Related Papers

Multi-Stage Prompt Inference Attacks on Enterprise LLM Systems2025-07-21The Power of Architecture: Deep Dive into Transformer Architectures for Long-Term Time Series Forecasting2025-07-173DKeyAD: High-Resolution 3D Point Cloud Anomaly Detection via Keypoint-Guided Point Clustering2025-07-17A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17Emergence of Functionally Differentiated Structures via Mutual Information Optimization in Recurrent Neural Networks2025-07-17A Privacy-Preserving Framework for Advertising Personalization Incorporating Federated Learning and Differential Privacy2025-07-16Data Augmentation in Time Series Forecasting through Inverted Framework2025-07-15