TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/LARS

LARS

GeneralIntroduced 200077 papers
Source Paper

Description

Layer-wise Adaptive Rate Scaling, or LARS, is a large batch optimization technique. There are two notable differences between LARS and other adaptive algorithms such as Adam or RMSProp: first, LARS uses a separate learning rate for each layer and not for each weight. And second, the magnitude of the update is controlled with respect to the weight norm for better control of training speed.

m_t=β_1m_t−1+(1−β_1)(g_t+λx_t)m\_{t} = \beta\_{1}m\_{t-1} + \left(1-\beta\_{1}\right)\left(g\_{t} + \lambda{x\_{t}}\right)m_t=β_1m_t−1+(1−β_1)(g_t+λx_t) x_t+1(i)=x_t(i)−η_tϕ(∣∣x_t(i)∣∣)∣∣m_t(i)∣∣m_t(i)x\_{t+1}^{\left(i\right)} = x\_{t}^{\left(i\right)} - \eta\_{t}\frac{\phi\left(|| x\_{t}^{\left(i\right)} ||\right)}{|| m\_{t}^{\left(i\right)} || }m\_{t}^{\left(i\right)} x_t+1(i)=x_t(i)−η_t∣∣m_t(i)∣∣ϕ(∣∣x_t(i)∣∣)​m_t(i)

Papers Using This Method

Analyzing Breast Cancer Survival Disparities by Race and Demographic Location: A Survival Analysis Approach2025-06-08Enhancing Federated Survival Analysis through Peer-Driven Client Reputation in Healthcare2025-05-22Is Self-Supervised Pre-training on Satellite Imagery Better than ImageNet? A Systematic Study with Sentinel-22025-02-15Self-Supervised Frameworks for Speaker Verification via Bootstrapped Positive Sampling2025-01-29Tackling Small Sample Survival Analysis via Transfer Learning: A Study of Colorectal Cancer Prognosis2025-01-21Prediction of Lung Metastasis from Hepatocellular Carcinoma using the SEER Database2025-01-20Diagnosis and Severity Assessment of Ulcerative Colitis using Self Supervised Learning2024-12-09SEER: Self-Aligned Evidence Extraction for Retrieval-Augmented Generation2024-10-15The Informed Elastic Net for Fast Grouped Variable Selection and FDR Control in Genomics Research2024-10-07Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs2024-06-17Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation2024-05-29Self-Supervised Multiple Instance Learning for Acute Myeloid Leukemia Classification2024-03-08SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning2024-01-24Survival Analysis of Young Triple-Negative Breast Cancer Patients2024-01-15Enhancing Contrastive Learning with Efficient Combinatorial Positive Pairing2024-01-11Quantum Algorithms for the Pathwise Lasso2023-12-21Feature Extraction for Generative Medical Imaging Evaluation: New Evidence Against an Evolving Trend2023-11-22SEER : A Knapsack approach to Exemplar Selection for In-Context HybridQA2023-10-10Revisiting LARS for Large Batch Training Generalization of Neural Networks2023-09-25Accelerating Large Batch Training via Gradient Signal to Noise Ratio (GSNR)2023-09-24