TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Informer: Beyond Efficient Transformer for Long Sequence T...

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, JianXin Li, Hui Xiong, Wancai Zhang

2020-12-14Time Series ForecastingUnivariate Time Series ForecastingTime SeriesMultivariate Time Series ForecastingTime Series Analysis
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCode

Abstract

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long sequence time-series forecasting (LSTF) demands a high prediction capacity of the model, which is the ability to capture precise long-range dependency coupling between output and input efficiently. Recent studies have shown the potential of Transformer to increase the prediction capacity. However, there are several severe issues with Transformer that prevent it from being directly applicable to LSTF, including quadratic time complexity, high memory usage, and inherent limitation of the encoder-decoder architecture. To address these issues, we design an efficient transformer-based model for LSTF, named Informer, with three distinctive characteristics: (i) a $ProbSparse$ self-attention mechanism, which achieves $O(L \log L)$ in time complexity and memory usage, and has comparable performance on sequences' dependency alignment. (ii) the self-attention distilling highlights dominating attention by halving cascading layer input, and efficiently handles extreme long input sequences. (iii) the generative style decoder, while conceptually simple, predicts the long time-series sequences at one forward operation rather than a step-by-step way, which drastically improves the inference speed of long-sequence predictions. Extensive experiments on four large-scale datasets demonstrate that Informer significantly outperforms existing methods and provides a new solution to the LSTF problem.

Results

TaskDatasetMetricValueModel
Time Series ForecastingETTh2 (336) UnivariateMAE0.323Informer
Time Series ForecastingETTh2 (336) UnivariateMSE0.166Informer
Time Series ForecastingETTh2 (168) UnivariateMAE0.306Informer
Time Series ForecastingETTh2 (168) UnivariateMSE0.154Informer
Time Series ForecastingETTh1 (24) MultivariateMAE0.523Informer
Time Series ForecastingETTh1 (24) MultivariateMSE0.509Informer
Time Series ForecastingETTh2 (168) MultivariateMAE0.996Informer
Time Series ForecastingETTh2 (168) MultivariateMSE1.512Informer
Time Series ForecastingETTh1 (24) UnivariateMAE0.152Informer
Time Series ForecastingETTh1 (24) UnivariateMSE0.046Informer
Time Series ForecastingETTh2 (720) MultivariateMAE1.209Informer
Time Series ForecastingETTh2 (720) MultivariateMSE2.34Informer
Time Series ForecastingETTh1 (720) MultivariateMAE0.768Informer
Time Series ForecastingETTh1 (720) MultivariateMSE0.941Informer
Time Series ForecastingETTh2 (336) MultivariateMAE1.035Informer
Time Series ForecastingETTh2 (336) MultivariateMSE1.665Informer
Time Series ForecastingETTh1 (720) UnivariateMAE0.357Informer
Time Series ForecastingETTh1 (720) UnivariateMSE0.201Informer
Time Series ForecastingETTh2 (24) MultivariateMAE0.523Informer
Time Series ForecastingETTh2 (24) MultivariateMSE0.446Informer
Time Series ForecastingETTh2 (24) UnivariateMAE0.213Informer
Time Series ForecastingETTh2 (24) UnivariateMSE0.083Informer
Time Series ForecastingETTh1 (336) MultivariateMAE0.753Informer
Time Series ForecastingETTh1 (336) MultivariateMSE0.884Informer
Time Series ForecastingETTh1 (168) MultivariateMAE0.722Informer
Time Series ForecastingETTh1 (168) MultivariateMSE0.878Informer
Time Series ForecastingETTh1 (168) UnivariateMAE0.337Informer
Time Series ForecastingETTh1 (168) UnivariateMSE0.183Informer
Time Series ForecastingETTh1 (48) MultivariateMAE0.563Informer
Time Series ForecastingETTh1 (48) MultivariateMSE0.551Informer
Time Series ForecastingETTh2 (720) UnivariateMAE0.338Informer
Time Series ForecastingETTh2 (720) UnivariateMSE0.181Informer
Time Series ForecastingETTh1 (48) UnivariateMAE0.274Informer
Time Series ForecastingETTh1 (48) UnivariateMSE0.129Informer
Time Series ForecastingETTh1 (336) UnivariateMAE0.346Informer
Time Series ForecastingETTh1 (336) UnivariateMSE0.189Informer
Time Series ForecastingETTh2 (48) MultivariateMAE0.733Informer
Time Series ForecastingETTh2 (48) MultivariateMSE0.934Informer
Time Series ForecastingETTh2 (48) UnivariateMAE0.249Informer
Time Series ForecastingETTh2 (48) UnivariateMSE0.111Informer
Time Series AnalysisETTh2 (336) UnivariateMAE0.323Informer
Time Series AnalysisETTh2 (336) UnivariateMSE0.166Informer
Time Series AnalysisETTh2 (168) UnivariateMAE0.306Informer
Time Series AnalysisETTh2 (168) UnivariateMSE0.154Informer
Time Series AnalysisETTh1 (24) MultivariateMAE0.523Informer
Time Series AnalysisETTh1 (24) MultivariateMSE0.509Informer
Time Series AnalysisETTh2 (168) MultivariateMAE0.996Informer
Time Series AnalysisETTh2 (168) MultivariateMSE1.512Informer
Time Series AnalysisETTh1 (24) UnivariateMAE0.152Informer
Time Series AnalysisETTh1 (24) UnivariateMSE0.046Informer
Time Series AnalysisETTh2 (720) MultivariateMAE1.209Informer
Time Series AnalysisETTh2 (720) MultivariateMSE2.34Informer
Time Series AnalysisETTh1 (720) MultivariateMAE0.768Informer
Time Series AnalysisETTh1 (720) MultivariateMSE0.941Informer
Time Series AnalysisETTh2 (336) MultivariateMAE1.035Informer
Time Series AnalysisETTh2 (336) MultivariateMSE1.665Informer
Time Series AnalysisETTh1 (720) UnivariateMAE0.357Informer
Time Series AnalysisETTh1 (720) UnivariateMSE0.201Informer
Time Series AnalysisETTh2 (24) MultivariateMAE0.523Informer
Time Series AnalysisETTh2 (24) MultivariateMSE0.446Informer
Time Series AnalysisETTh2 (24) UnivariateMAE0.213Informer
Time Series AnalysisETTh2 (24) UnivariateMSE0.083Informer
Time Series AnalysisETTh1 (336) MultivariateMAE0.753Informer
Time Series AnalysisETTh1 (336) MultivariateMSE0.884Informer
Time Series AnalysisETTh1 (168) MultivariateMAE0.722Informer
Time Series AnalysisETTh1 (168) MultivariateMSE0.878Informer
Time Series AnalysisETTh1 (168) UnivariateMAE0.337Informer
Time Series AnalysisETTh1 (168) UnivariateMSE0.183Informer
Time Series AnalysisETTh1 (48) MultivariateMAE0.563Informer
Time Series AnalysisETTh1 (48) MultivariateMSE0.551Informer
Time Series AnalysisETTh2 (720) UnivariateMAE0.338Informer
Time Series AnalysisETTh2 (720) UnivariateMSE0.181Informer
Time Series AnalysisETTh1 (48) UnivariateMAE0.274Informer
Time Series AnalysisETTh1 (48) UnivariateMSE0.129Informer
Time Series AnalysisETTh1 (336) UnivariateMAE0.346Informer
Time Series AnalysisETTh1 (336) UnivariateMSE0.189Informer
Time Series AnalysisETTh2 (48) MultivariateMAE0.733Informer
Time Series AnalysisETTh2 (48) MultivariateMSE0.934Informer
Time Series AnalysisETTh2 (48) UnivariateMAE0.249Informer
Time Series AnalysisETTh2 (48) UnivariateMSE0.111Informer

Related Papers

The Power of Architecture: Deep Dive into Transformer Architectures for Long-Term Time Series Forecasting2025-07-17MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17Emergence of Functionally Differentiated Structures via Mutual Information Optimization in Recurrent Neural Networks2025-07-17Data Augmentation in Time Series Forecasting through Inverted Framework2025-07-15D3FL: Data Distribution and Detrending for Robust Federated Learning in Non-linear Time-series Data2025-07-15Towards Interpretable Time Series Foundation Models2025-07-10MoFE-Time: Mixture of Frequency Domain Experts for Time-Series Forecasting Models2025-07-09Foundation models for time series forecasting: Application in conformal prediction2025-07-09