QHM

GeneralIntroduced 20002 papers

Description

Quasi-Hyperbolic Momentum (QHM) is a stochastic optimization technique that alters momentum SGD with a momentum step, averaging an SGD step with a momentum step:

$g\_{t+1} = \beta{g\_{t}} + \left(1-\beta\right)\cdot{\nabla}\hat{L}\_{t}\left(\theta\_{t}\right)$ $\theta\_{t+1} = \theta\_{t} - \alpha\left[\left(1-v\right)\cdot\nabla\hat{L}\_{t}\left(\theta\_{t}\right) + v\cdot{g\_{t+1}}\right]$

The authors suggest a rule of thumb of $v = 0.7$ and $\beta = 0.999$ .

Papers Using This Method

Understanding the Role of Momentum in Stochastic Gradient Methods2019-10-30 Quasi-hyperbolic momentum and Adam for deep learning2018-10-16

QHM

GeneralIntroduced 20002 papers

Source Paper

Description

Quasi-Hyperbolic Momentum (QHM) is a stochastic optimization technique that alters momentum SGD with a momentum step, averaging an SGD step with a momentum step:

The authors suggest a rule of thumb of $v = 0.7$ and $\beta = 0.999$ .

Papers Using This Method

Understanding the Role of Momentum in Stochastic Gradient Methods2019-10-30 Quasi-hyperbolic momentum and Adam for deep learning2018-10-16