Methods

5,489 machine learning methods and techniques

All Audio Computer Vision General Graphs Natural Language Processing Reinforcement Learning Sequential

Neural adjoint

Neural adjoint method

The NA method can be divided into two steps: (i) Training a neural network approximation of f , and (ii) inference of xˆ. Step (i) is conventional and involves training a generic neural network on a dataset ˆ of input/output pairs from the simulator, denoted D, resulting in f, an approximation of the forward ˆ model. This is illustrated in the left inset of Fig 1. In step (ii), our goal is to use ∂f/∂x to help us gradually adjust x so that we achieve a desired output of the forward model, y. This is similar to many classical inverse modeling approaches, such as the popular Adjoint method [8, 9]. For many practical ˆ expression for the simulator, from which it is trivial to compute ∂f/∂x, and furthermore, we can use modern deep learning software packages to efficiently estimate gradients, given a loss function L. More formally, let y be our target output, and let xˆi be our current estimate of the solution, where i indexes each solution we obtain in an iterative gradient-based estimation procedure. Then we compute xˆi+1 with inverse problems, however, obtaining ∂f/∂x requires significant expertise and/or effort, making these approaches challenging. Crucially, fˆ from step (i) provides us with a closed-form differentiable

GeneralIntroduced 20003 papers

PyTorch DDP

PyTorch DDP (Distributed Data Parallel) is a distributed data parallel implementation for PyTorch. To guarantee mathematical equivalence, all replicas start from the same initial values for model parameters and synchronize gradients to keep parameters consistent across training iterations. To minimize the intrusiveness, the implementation exposes the same forward API as the user model, allowing applications to seamlessly replace subsequent occurrences of a user model with the distributed data parallel model object with no additional code changes. Several techniques are integrated into the design to deliver high-performance training, including bucketing gradients, overlapping communication with computation, and skipping synchronization.

GeneralIntroduced 20003 papers

reSGLD

Replica exchange stochastic gradient Langevin Dynamics

reSGLD proposes to simulate a high-temperature particle for exploration and a low-temperature particle for exploitation and allows them to swap simultaneously. Moreover, a correction term is included to avoid biases.

GeneralIntroduced 20003 papers

3D SA

3 Dimensional Soft Attention

Methods

Neural adjoint

PyTorch DDP

reSGLD

3D SA

E-swish

L2M

ProxyAnchorLoss

Targeted Dropout

AdEMAMix

Adversarial Soft Advantage Fitting (ASAF)

DSelect-k

MFEC

Channel & Spatial attention

MPSO

ZLPR Loss

{{off-peak days-ASK}}Is there a grace period for Expedia?

Fraternal Dropout

Ternary Weight Splitting

RMN

[[booked on Expedia~tickets]]Are Expedia plane tickets transferable?

{[(Faqs/Expedia/Guide-)]}Is there a cancellation fee on Expedia?

SIFA

MSGAN

AHAF

Auto-Classifier

IMGEP

AutoML-Zero

Cosine Normalization

{𝔼𝕩𝕡𝕖𝕕𝕚𝕒-24-𝘏𝘰𝘶𝘳𝘴-𝐌𝐚𝐧}How much does Expedia charge to cancel a flight?

AdaFisher

Concurrent Spatial and Channel Squeeze & Excitation

SCA-CNN

PAU

EMEA

Global Sub-Sampled Attention

SSTDA

Recurrent Entity Network

Mesh-TensorFlow

SAFRAN

Spatially Separable Self-Attention

AdaptiveBins

Crossbow

CELU

End-To-End Memory Network

FRbE

Class Activation Guided Attention Mechanism

SkipInit

LFME

CV-MIM

CCAC

Methods

Neural adjoint

PyTorch DDP

reSGLD

3D SA

E-swish

L2M

ProxyAnchorLoss

Targeted Dropout

AdEMAMix

Adversarial Soft Advantage Fitting (ASAF)

DSelect-k

MFEC

Channel & Spatial attention

MPSO

ZLPR Loss

{{off-peak days-ASK}}Is there a grace period for Expedia?

Fraternal Dropout

Ternary Weight Splitting

RMN

[[booked on Expedia~tickets]]Are Expedia plane tickets transferable?

{[(Faqs/Expedia/Guide-)]}Is there a cancellation fee on Expedia?

SIFA

MSGAN

AHAF

Auto-Classifier

IMGEP

AutoML-Zero

Cosine Normalization