Quasi-Hyperbolic Momentum (QHM) is a stochastic optimization technique that alters momentum SGD with a momentum step, averaging an SGD step with a momentum step:
g_t+1=βg_t+(1−β)⋅∇L^_t(θ_t)
θ_t+1=θ_t−α[(1−v)⋅∇L^_t(θ_t)+v⋅g_t+1]
The authors suggest a rule of thumb of v=0.7 and β=0.999.