TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/RMSProp

RMSProp

GeneralIntroduced 2013519 papers

Description

RMSProp is an unpublished adaptive learning rate optimizer proposed by Geoff Hinton. The motivation is that the magnitude of gradients can differ for different weights, and can change during learning, making it hard to choose a single global learning rate. RMSProp tackles this by keeping a moving average of the squared gradient and adjusting the weight updates by this magnitude. The gradient updates are performed as:

E[g2]_t=γE[g2]_t−1+(1−γ)g2_tE\left[g^{2}\right]\_{t} = \gamma E\left[g^{2}\right]\_{t-1} + \left(1 - \gamma\right) g^{2}\_{t}E[g2]_t=γE[g2]_t−1+(1−γ)g2_t

θ_t+1=θ_t−ηE[g2]_t+ϵg_t\theta\_{t+1} = \theta\_{t} - \frac{\eta}{\sqrt{E\left[g^{2}\right]\_{t} + \epsilon}}g\_{t}θ_t+1=θ_t−E[g2]_t+ϵ​η​g_t

Hinton suggests γ=0.9\gamma=0.9γ=0.9, with a good default for η\etaη as 0.0010.0010.001.

Image: Alec Radford

Papers Using This Method

Hindsight-Guided Momentum (HGM) Optimizer: An Approach to Adaptive Learning Rate2025-06-22Deep Learning-Based BMD Estimation from Radiographs with Conformal Uncertainty Quantification2025-05-28Intelligent Incident Hypertension Prediction in Obstructive Sleep Apnea2025-05-27Deep Learning for Breast Cancer Detection: Comparative Analysis of ConvNeXT and EfficientNet2025-05-24SuperPure: Efficient Purification of Localized and Distributed Adversarial Patches via Super-Resolution GAN Models2025-05-22Vulnerability of Transfer-Learned Neural Networks to Data Reconstruction Attacks in Small-Data Regime2025-05-20Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data2025-05-15Real-World fNIRS-Based Brain-Computer Interfaces: Benchmarking Deep Learning and Classical Models in Interactive Gaming2025-05-15Trial and Trust: Addressing Byzantine Attacks with Comprehensive Defense Strategy2025-05-12V-EfficientNets: Vector-Valued Efficiently Scaled Convolutional Neural Network Models2025-05-08AnomalyMatch: Discovering Rare Objects of Interest with Semi-supervised and Active Learning2025-05-06CSASN: A Multitask Attention-Based Framework for Heterogeneous Thyroid Carcinoma Classification in Ultrasound Images2025-05-04Conformal Prediction for Indoor Positioning with Correctness Coverage Guarantees2025-05-03Sharp higher order convergence rates for the Adam optimizer2025-04-28Some Optimizers are More Equal: Understanding the Role of Optimizers in Group Fairness2025-04-21Covariant Gradient Descent2025-04-07Training Frozen Feature Pyramid DINOv2 for Eyelid Measurements with Infinite Encoding and Orthogonal Regularization2025-04-01Efficient Building Roof Type Classification: A Domain-Specific Self-Supervised Approach2025-03-28World Model Agents with Change-Based Intrinsic Motivation2025-03-26Deep learning-based identification of precipitation clouds from all-sky camera data for observatory safety2025-03-24