TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Mixture-of-Linear-Experts for Long-term Time Series Foreca...

Mixture-of-Linear-Experts for Long-term Time Series Forecasting

Ronghao Ni, Zinan Lin, Shuaiqi Wang, Giulia Fanti

2023-12-11Time Series ForecastingTime Series
PaperPDFCode(official)

Abstract

Long-term time series forecasting (LTSF) aims to predict future values of a time series given the past values. The current state-of-the-art (SOTA) on this problem is attained in some cases by linear-centric models, which primarily feature a linear mapping layer. However, due to their inherent simplicity, they are not able to adapt their prediction rules to periodic changes in time series patterns. To address this challenge, we propose a Mixture-of-Experts-style augmentation for linear-centric models and propose Mixture-of-Linear-Experts (MoLE). Instead of training a single model, MoLE trains multiple linear-centric models (i.e., experts) and a router model that weighs and mixes their outputs. While the entire framework is trained end-to-end, each expert learns to specialize in a specific temporal pattern, and the router model learns to compose the experts adaptively. Experiments show that MoLE reduces forecasting error of linear-centric models, including DLinear, RLinear, and RMLP, in over 78% of the datasets and settings we evaluated. By using MoLE existing linear-centric models can achieve SOTA LTSF results in 68% of the experiments that PatchTST reports and we compare to, whereas existing single-head linear-centric models achieve SOTA results in only 25% of cases.

Results

TaskDatasetMetricValueModel
Time Series ForecastingWeather2K114 (96)MSE0.391MoLE-DLinear
Time Series ForecastingWeather2K79 (336)MSE0.546MoLE-DLinear
Time Series ForecastingETTh2 (720) MultivariateMSE0.409MoLE-RLinear
Time Series ForecastingETTh2 (720) MultivariateMSE0.605MoLE-DLinear
Time Series ForecastingETTh1 (720) MultivariateMSE0.449MoLE-RLinear
Time Series ForecastingETTh1 (720) MultivariateMSE0.505MoLE-DLinear
Time Series ForecastingWeather2K114 (720)MSE0.425MoLE-DLinear
Time Series ForecastingETTm1 (192) MultivariateMSE0.328MoLE-DLinear
Time Series ForecastingWeather (192)MSE0.19MoLE-RMLP
Time Series ForecastingWeather (192)MSE0.203MoLE-DLinear
Time Series ForecastingWeather (336)MSE0.238MoLE-DLinear
Time Series ForecastingETTm2 (96) MultivariateMSE0.168MoLE-DLinear
Time Series ForecastingElectricity (336)MSE0.162MoLE-DLinear
Time Series ForecastingWeather (720)MSE0.314MoLE-DLinear
Time Series ForecastingETTh2 (336) MultivariateMSE0.371MoLE-RLinear
Time Series ForecastingETTh2 (336) MultivariateMSE0.419MoLE-DLinear
Time Series ForecastingWeather2K850 (192)MSE0.484MoLE-DLinear
Time Series ForecastingETTh1 (192) MultivariateMSE0.403MoLE-RLinear
Time Series ForecastingETTh1 (192) MultivariateMSE0.453MoLE-DLinear
Time Series ForecastingWeather2K850 (720)MSE0.461MoLE-DLinear
Time Series ForecastingElectricity (192)MSE0.147MoLE-DLinear
Time Series ForecastingETTh1 (336) MultivariateMSE0.43MoLE-RLinear
Time Series ForecastingETTh1 (336) MultivariateMSE0.469MoLE-DLinear
Time Series ForecastingWeather2K850 (336)MSE0.474MoLE-DLinear
Time Series ForecastingETTm2 (336) MultivariateMSE0.289MoLE-DLinear
Time Series ForecastingETTm1 (336) MultivariateMSE0.38MoLE-DLinear
Time Series ForecastingETTm1 (96) MultivariateMSE0.286MoLE-DLinear
Time Series ForecastingETTm1 (720) MultivariateMSE0.447MoLE-DLinear
Time Series ForecastingETTh2 (96) MultivariateMSE0.273MoLE-RLinear
Time Series ForecastingETTh2 (96) MultivariateMSE0.287MoLE-DLinear
Time Series ForecastingWeather2K79 (96)MSE0.555MoLE-DLinear
Time Series ForecastingWeather2K79 (720)MSE0.535MoLE-DLinear
Time Series ForecastingWeather2K1786 (192)MSE0.581MoLE-RLinear
Time Series ForecastingWeather2K1786 (192)MSE0.601MoLE-DLinear
Time Series ForecastingWeather2K850 (96)MSE0.471MoLE-RLinear
Time Series ForecastingWeather2K850 (96)MSE0.474MoLE-DLinear
Time Series ForecastingWeather (96)MSE0.147MoLE-DLinear
Time Series ForecastingETTm2 (192) MultivariateMSE0.233MoLE-DLinear
Time Series ForecastingETTh1 (96) MultivariateMSE0.375MoLE-RLinear
Time Series ForecastingETTh1 (96) MultivariateMSE0.377MoLE-DLinear
Time Series ForecastingWeather2K114 (192)MSE0.405MoLE-DLinear
Time Series ForecastingElectricity (96)MSE0.129MoLE-RMLP
Time Series ForecastingElectricity (96)MSE0.131MoLE-DLinear
Time Series ForecastingWeather2K114 (336)MSE0.415MoLE-DLinear
Time Series ForecastingWeather2K1786 (336)MSE0.603MoLE-DLinear
Time Series ForecastingETTh2 (192) MultivariateMSE0.336MoLE-RLinear
Time Series ForecastingETTh2 (192) MultivariateMSE0.362MoLE-DLinear
Time Series ForecastingWeather2K1786 (720)MSE0.628MoLE-RLinear
Time Series ForecastingWeather2K1786 (720)MSE0.66MoLE-DLinear
Time Series ForecastingWeather2K1786 (96)MSE0.535MoLE-DLinear
Time Series ForecastingWeather2K1786 (96)MSE0.535MoLE-RLinear
Time Series ForecastingWeather2K79 (192)MSE0.566MoLE-DLinear
Time Series ForecastingElectricity (720)MSE0.178MoLE-RMLP
Time Series ForecastingElectricity (720)MSE0.18MoLE-DLinear
Time Series ForecastingETTm2 (720) MultivariateMSE0.399MoLE-DLinear
Time Series AnalysisWeather2K114 (96)MSE0.391MoLE-DLinear
Time Series AnalysisWeather2K79 (336)MSE0.546MoLE-DLinear
Time Series AnalysisETTh2 (720) MultivariateMSE0.409MoLE-RLinear
Time Series AnalysisETTh2 (720) MultivariateMSE0.605MoLE-DLinear
Time Series AnalysisETTh1 (720) MultivariateMSE0.449MoLE-RLinear
Time Series AnalysisETTh1 (720) MultivariateMSE0.505MoLE-DLinear
Time Series AnalysisWeather2K114 (720)MSE0.425MoLE-DLinear
Time Series AnalysisETTm1 (192) MultivariateMSE0.328MoLE-DLinear
Time Series AnalysisWeather (192)MSE0.19MoLE-RMLP
Time Series AnalysisWeather (192)MSE0.203MoLE-DLinear
Time Series AnalysisWeather (336)MSE0.238MoLE-DLinear
Time Series AnalysisETTm2 (96) MultivariateMSE0.168MoLE-DLinear
Time Series AnalysisElectricity (336)MSE0.162MoLE-DLinear
Time Series AnalysisWeather (720)MSE0.314MoLE-DLinear
Time Series AnalysisETTh2 (336) MultivariateMSE0.371MoLE-RLinear
Time Series AnalysisETTh2 (336) MultivariateMSE0.419MoLE-DLinear
Time Series AnalysisWeather2K850 (192)MSE0.484MoLE-DLinear
Time Series AnalysisETTh1 (192) MultivariateMSE0.403MoLE-RLinear
Time Series AnalysisETTh1 (192) MultivariateMSE0.453MoLE-DLinear
Time Series AnalysisWeather2K850 (720)MSE0.461MoLE-DLinear
Time Series AnalysisElectricity (192)MSE0.147MoLE-DLinear
Time Series AnalysisETTh1 (336) MultivariateMSE0.43MoLE-RLinear
Time Series AnalysisETTh1 (336) MultivariateMSE0.469MoLE-DLinear
Time Series AnalysisWeather2K850 (336)MSE0.474MoLE-DLinear
Time Series AnalysisETTm2 (336) MultivariateMSE0.289MoLE-DLinear
Time Series AnalysisETTm1 (336) MultivariateMSE0.38MoLE-DLinear
Time Series AnalysisETTm1 (96) MultivariateMSE0.286MoLE-DLinear
Time Series AnalysisETTm1 (720) MultivariateMSE0.447MoLE-DLinear
Time Series AnalysisETTh2 (96) MultivariateMSE0.273MoLE-RLinear
Time Series AnalysisETTh2 (96) MultivariateMSE0.287MoLE-DLinear
Time Series AnalysisWeather2K79 (96)MSE0.555MoLE-DLinear
Time Series AnalysisWeather2K79 (720)MSE0.535MoLE-DLinear
Time Series AnalysisWeather2K1786 (192)MSE0.581MoLE-RLinear
Time Series AnalysisWeather2K1786 (192)MSE0.601MoLE-DLinear
Time Series AnalysisWeather2K850 (96)MSE0.471MoLE-RLinear
Time Series AnalysisWeather2K850 (96)MSE0.474MoLE-DLinear
Time Series AnalysisWeather (96)MSE0.147MoLE-DLinear
Time Series AnalysisETTm2 (192) MultivariateMSE0.233MoLE-DLinear
Time Series AnalysisETTh1 (96) MultivariateMSE0.375MoLE-RLinear
Time Series AnalysisETTh1 (96) MultivariateMSE0.377MoLE-DLinear
Time Series AnalysisWeather2K114 (192)MSE0.405MoLE-DLinear
Time Series AnalysisElectricity (96)MSE0.129MoLE-RMLP
Time Series AnalysisElectricity (96)MSE0.131MoLE-DLinear
Time Series AnalysisWeather2K114 (336)MSE0.415MoLE-DLinear
Time Series AnalysisWeather2K1786 (336)MSE0.603MoLE-DLinear
Time Series AnalysisETTh2 (192) MultivariateMSE0.336MoLE-RLinear
Time Series AnalysisETTh2 (192) MultivariateMSE0.362MoLE-DLinear
Time Series AnalysisWeather2K1786 (720)MSE0.628MoLE-RLinear
Time Series AnalysisWeather2K1786 (720)MSE0.66MoLE-DLinear
Time Series AnalysisWeather2K1786 (96)MSE0.535MoLE-DLinear
Time Series AnalysisWeather2K1786 (96)MSE0.535MoLE-RLinear
Time Series AnalysisWeather2K79 (192)MSE0.566MoLE-DLinear
Time Series AnalysisElectricity (720)MSE0.178MoLE-RMLP
Time Series AnalysisElectricity (720)MSE0.18MoLE-DLinear
Time Series AnalysisETTm2 (720) MultivariateMSE0.399MoLE-DLinear

Related Papers

The Power of Architecture: Deep Dive into Transformer Architectures for Long-Term Time Series Forecasting2025-07-17MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17Data Augmentation in Time Series Forecasting through Inverted Framework2025-07-15D3FL: Data Distribution and Detrending for Robust Federated Learning in Non-linear Time-series Data2025-07-15Towards Interpretable Time Series Foundation Models2025-07-10MoFE-Time: Mixture of Frequency Domain Experts for Time-Series Forecasting Models2025-07-09Foundation models for time series forecasting: Application in conformal prediction2025-07-09Bridging the Last Mile of Prediction: Enhancing Time Series Forecasting with Conditional Guided Flow Matching2025-07-09