Aggregated Momentum (AggMo) is a variant of the classical momentum stochastic optimizer which maintains several velocity vectors with different β parameters. AggMo averages the velocity vectors when updating the parameters. It resolves the problem of choosing a momentum parameter by taking a linear combination of multiple momentum buffers. Each of K momentum buffers have a different discount factor β∈RK, and these are averaged for the update. The update rule is:
v_t(i)=β(i)v_t−1(i)−∇_θf(θ_t−1)
θ_t=θ_t−1+Kγ_t∑K_i=1v_t(i)
where v0(i) for each i. The vector β=[β(1),…,β(K)] is the dampening factor.