TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Scalable Gradients for Stochastic Differential Equations

Scalable Gradients for Stochastic Differential Equations

Xuechen Li, Ting-Kam Leonard Wong, Ricky T. Q. Chen, David Duvenaud

2020-01-05Video PredictionVariational Inference
PaperPDFCodeCodeCodeCode(official)

Abstract

The adjoint sensitivity method scalably computes gradients of solutions to ordinary differential equations. We generalize this method to stochastic differential equations, allowing time-efficient and constant-memory computation of gradients with high-order adaptive solvers. Specifically, we derive a stochastic differential equation whose solution is the gradient, a memory-efficient algorithm for caching noise, and conditions under which numerical solutions converge. In addition, we combine our method with gradient-based stochastic variational inference for latent stochastic differential equations. We use our method to fit stochastic dynamics defined by neural networks, achieving competitive performance on a 50-dimensional motion capture dataset.

Results

TaskDatasetMetricValueModel
VideoCMU Mocap-2Test Error4.03Latent SDE
VideoCMU Mocap-2Test Error5.98Latent ODE
Video PredictionCMU Mocap-2Test Error4.03Latent SDE
Video PredictionCMU Mocap-2Test Error5.98Latent ODE

Related Papers

Interpretable Bayesian Tensor Network Kernel Machines with Automatic Rank and Feature Selection2025-07-15Epona: Autoregressive Diffusion World Model for Autonomous Driving2025-06-30Whole-Body Conditioned Egocentric Video Prediction2025-06-26Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference2025-06-26MinD: Unified Visual Imagination and Control via Hierarchical World Models2025-06-23VHU-Net: Variational Hadamard U-Net for Body MRI Bias Field Correction2025-06-23AMPLIFY: Actionless Motion Priors for Robot Learning from Videos2025-06-17Branching Stein Variational Gradient Descent for sampling multimodal distributions2025-06-16