SLAMB

Sparse Layer-wise Adaptive Moments optimizer for large Batch training

GeneralIntroduced 20001 papers

Description

Please enter a description about the method here

Papers Using This Method