Learning Neural PDE Solvers with Parameter-Guided Channel Attention

Makoto Takamoto, Francesco Alesiani, Mathias Niepert

2023-04-27Weather Forecasting PDE Surrogate Modeling

Abstract

Scientific Machine Learning (SciML) is concerned with the development of learned emulators of physical systems governed by partial differential equations (PDE). In application domains such as weather forecasting, molecular dynamics, and inverse design, ML-based surrogate models are increasingly used to augment or replace inefficient and often non-differentiable numerical simulation algorithms. While a number of ML-based methods for approximating the solutions of PDEs have been proposed in recent years, they typically do not adapt to the parameters of the PDEs, making it difficult to generalize to PDE parameters not seen during training. We propose a Channel Attention mechanism guided by PDE Parameter Embeddings (CAPE) component for neural surrogate models and a simple yet effective curriculum learning strategy. The CAPE module can be combined with neural PDE solvers allowing them to adapt to unseen PDE parameters. The curriculum learning strategy provides a seamless transition between teacher-forcing and fully auto-regressive training. We compare CAPE in conjunction with the curriculum learning strategy using a popular PDE benchmark and obtain consistent and significant improvements over the baseline models. The experiments also show several advantages of CAPE, such as its increased ability to generalize to unseen PDE parameters without large increases inference time and parameter count.

Related Papers

FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale2025-07-16 D3FL: Data Distribution and Detrending for Robust Federated Learning in Non-linear Time-series Data2025-07-15 Optimising 4th-Order Runge-Kutta Methods: A Dynamic Heuristic Approach for Efficiency and Low Storage2025-06-26 Distributed Cross-Channel Hierarchical Aggregation for Foundation Models2025-06-26 Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting2025-06-24 Skillful joint probabilistic weather forecasting from marginals2025-06-12 A multi-scale loss formulation for learning a probabilistic model with proper score optimisation2025-06-12 AtmosMJ: Revisiting Gating Mechanism for AI Weather Forecasting Beyond the Year Scale2025-06-11