TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Non-convex Learning via Replica Exchange Stochastic Gradie...

Non-convex Learning via Replica Exchange Stochastic Gradient MCMC

Wei Deng, Qi Feng, Liyao Gao, Faming Liang, Guang Lin

2020-08-12ICML 2020 1Image Classification
PaperPDFCodeCode(official)

Abstract

Replica exchange Monte Carlo (reMC), also known as parallel tempering, is an important technique for accelerating the convergence of the conventional Markov Chain Monte Carlo (MCMC) algorithms. However, such a method requires the evaluation of the energy function based on the full dataset and is not scalable to big data. The na\"ive implementation of reMC in mini-batch settings introduces large biases, which cannot be directly extended to the stochastic gradient MCMC (SGMCMC), the standard sampling method for simulating from deep neural networks (DNNs). In this paper, we propose an adaptive replica exchange SGMCMC (reSGMCMC) to automatically correct the bias and study the corresponding properties. The analysis implies an acceleration-accuracy trade-off in the numerical discretization of a Markov jump process in a stochastic environment. Empirically, we test the algorithm through extensive experiments on various setups and obtain the state-of-the-art results on CIFAR10, CIFAR100, and SVHN in both supervised learning and semi-supervised learning tasks.

Results

TaskDatasetMetricValueModel
Image ClassificationCIFAR-10Percentage correct97.42WRN-28-10 with reSGHMC
Image ClassificationCIFAR-10Percentage correct96.87WRN-16-8 with reSGHMC
Image ClassificationCIFAR-10Percentage correct96.12ResNet56 with reSGHMC
Image ClassificationCIFAR-10Percentage correct95.35ResNet32 with reSGHMC
Image ClassificationCIFAR-10Percentage correct94.62ResNet20 with reSGHMC
Image ClassificationCIFAR-100Percentage correct84.38WRN-28-10 with reSGHMC
Image ClassificationCIFAR-100Percentage correct82.95WRN-16-8 with reSGHMC
Image ClassificationCIFAR-100Percentage correct80.14ResNet56 with reSGHMC
Image ClassificationCIFAR-100Percentage correct76.55ResNet32 with reSGHMC
Image ClassificationCIFAR-100Percentage correct74.14ResNet20 with reSGHMC

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks2025-07-14FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise2025-07-13