AdaHessian

ADAHESSIAN

GeneralIntroduced 20006 papers

Description

ADAHESSIAN is a new stochastic optimization algorithm that directly incorporates approximate curvature information from the loss function, and it includes several novel performance-improving features, including a fast Hutchinson based method to approximate the curvature matrix with low computational overhead.

Papers Using This Method