Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/RMSProp

RMSProp

GeneralIntroduced 2013519 papers

Description

RMSProp is an unpublished adaptive learning rate optimizer proposed by Geoff Hinton. The motivation is that the magnitude of gradients can differ for different weights, and can change during learning, making it hard to choose a single global learning rate. RMSProp tackles this by keeping a moving average of the squared gradient and adjusting the weight updates by this magnitude. The gradient updates are performed as:

$E\left[g^{2}\right]\_{t} = \gamma E\left[g^{2}\right]\_{t-1} + \left(1 - \gamma\right) g^{2}\_{t}$

$\theta\_{t+1} = \theta\_{t} - \frac{\eta}{\sqrt{E\left[g^{2}\right]\_{t} + \epsilon}}g\_{t}$

Hinton suggests $\gamma=0.9$ , with a good default for $\eta$ as $0.001$ .

Image: Alec Radford

Papers Using This Method

Hindsight-Guided Momentum (HGM) Optimizer: An Approach to Adaptive Learning Rate2025-06-22 Deep Learning-Based BMD Estimation from Radiographs with Conformal Uncertainty Quantification2025-05-28 Intelligent Incident Hypertension Prediction in Obstructive Sleep Apnea2025-05-27 Deep Learning for Breast Cancer Detection: Comparative Analysis of ConvNeXT and EfficientNet2025-05-24 SuperPure: Efficient Purification of Localized and Distributed Adversarial Patches via Super-Resolution GAN Models2025-05-22 Vulnerability of Transfer-Learned Neural Networks to Data Reconstruction Attacks in Small-Data Regime2025-05-20 Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data2025-05-15 Real-World fNIRS-Based Brain-Computer Interfaces: Benchmarking Deep Learning and Classical Models in Interactive Gaming2025-05-15 Trial and Trust: Addressing Byzantine Attacks with Comprehensive Defense Strategy2025-05-12 V-EfficientNets: Vector-Valued Efficiently Scaled Convolutional Neural Network Models2025-05-08 AnomalyMatch: Discovering Rare Objects of Interest with Semi-supervised and Active Learning2025-05-06 CSASN: A Multitask Attention-Based Framework for Heterogeneous Thyroid Carcinoma Classification in Ultrasound Images2025-05-04 Conformal Prediction for Indoor Positioning with Correctness Coverage Guarantees2025-05-03 Sharp higher order convergence rates for the Adam optimizer2025-04-28 Some Optimizers are More Equal: Understanding the Role of Optimizers in Group Fairness2025-04-21 Covariant Gradient Descent2025-04-07 Training Frozen Feature Pyramid DINOv2 for Eyelid Measurements with Infinite Encoding and Orthogonal Regularization2025-04-01 Efficient Building Roof Type Classification: A Domain-Specific Self-Supervised Approach2025-03-28 World Model Agents with Change-Based Intrinsic Motivation2025-03-26 Deep learning-based identification of precipitation clouds from all-sky camera data for observatory safety2025-03-24