TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Improving Generalization in Federated Learning by Seeking ...

Improving Generalization in Federated Learning by Seeking Flat Minima

Debora Caldarola, Barbara Caputo, Marco Ciccone

2022-03-22Image ClassificationDomain GeneralizationFederated LearningSemantic Segmentation
PaperPDFCode(official)

Abstract

Models trained in federated settings often suffer from degraded performances and fail at generalizing, especially when facing heterogeneous scenarios. In this work, we investigate such behavior through the lens of geometry of the loss and Hessian eigenspectrum, linking the model's lack of generalization capacity to the sharpness of the solution. Motivated by prior studies connecting the sharpness of the loss surface and the generalization gap, we show that i) training clients locally with Sharpness-Aware Minimization (SAM) or its adaptive version (ASAM) and ii) averaging stochastic weights (SWA) on the server-side can substantially improve generalization in Federated Learning and help bridging the gap with centralized models. By seeking parameters in neighborhoods having uniform low loss, the model converges towards flatter minima and its generalization significantly improves in both homogeneous and heterogeneous scenarios. Empirical results demonstrate the effectiveness of those optimizers across a variety of benchmark vision datasets (e.g. CIFAR10/100, Landmarks-User-160k, IDDA) and tasks (large scale classification, semantic segmentation, domain generalization).

Results

TaskDatasetMetricValueModel
Image ClassificationCIFAR-100 (alpha=0, 20 clients per round)ACC@1-100Clients51.58FedAvgM + ASAM + SWA
Federated LearningCIFAR-100 (alpha=0, 5 clients per round)ACC@1-100Clients42.01FedASAM + SWA
Federated LearningCIFAR-100 (alpha=0, 5 clients per round)ACC@1-100Clients39.3FedSAM + SWA
Federated LearningCIFAR-100 (alpha=0, 5 clients per round)ACC@1-100Clients36.04FedASAM
Federated LearningCIFAR-100 (alpha=0, 5 clients per round)ACC@1-100Clients31.04FedSAM
Federated LearningCIFAR-100 (alpha=0, 5 clients per round)ACC@1-100Clients30.25FedAvg
Federated LearningCIFAR-100 (alpha=0, 10 clients per round)ACC@1-100Clients42.64FedASAM + SWA
Federated LearningCIFAR-100 (alpha=0, 10 clients per round)ACC@1-100Clients39.76FedASAM
Federated LearningCIFAR-100 (alpha=0, 10 clients per round)ACC@1-100Clients39.51FedSAM + SWA
Federated LearningCIFAR-100 (alpha=0, 10 clients per round)ACC@1-100Clients36.93FedSAM
Federated LearningCIFAR-100 (alpha=0, 10 clients per round)ACC@1-100Clients36.74FedAvg
Federated LearningCIFAR-100 (alpha=0.5, 5 clients per round)ACC@1-100Clients49.17FedASAM + SWA
Federated LearningCIFAR-100 (alpha=0.5, 5 clients per round)ACC@1-100Clients47.96FedSAM + SWA
Federated LearningCIFAR-100 (alpha=0.5, 5 clients per round)ACC@1-100Clients45.61FedASAM
Federated LearningCIFAR-100 (alpha=0.5, 5 clients per round)ACC@1-100Clients44.73FedSAM
Federated LearningCIFAR-100 (alpha=0.5, 5 clients per round)ACC@1-100Clients40.43FedAvg
Federated LearningCIFAR-100 (alpha=0, 20 clients per round)ACC@1-100Clients41.62FedASAM + SWA
Federated LearningCIFAR-100 (alpha=0, 20 clients per round)ACC@1-100Clients40.81FedASAM
Federated LearningCIFAR-100 (alpha=0, 20 clients per round)ACC@1-100Clients39.24FedSAM + SWA
Federated LearningCIFAR-100 (alpha=0, 20 clients per round)ACC@1-100Clients38.59FedAvg
Federated LearningCIFAR-100 (alpha=0, 20 clients per round)ACC@1-100Clients38.56FedSAM
Federated LearningCIFAR-100 (alpha=0.5, 20 clients per round)ACC@1-100Clients48.27FedASAM + SWA
Federated LearningCIFAR-100 (alpha=0.5, 20 clients per round)ACC@1-100Clients47.78FedASAM
Federated LearningCIFAR-100 (alpha=0.5, 20 clients per round)ACC@1-100Clients46.47FedSAM + SWA
Federated LearningCIFAR-100 (alpha=0.5, 20 clients per round)ACC@1-100Clients46.05FedSAM
Federated LearningCIFAR-100 (alpha=0.5, 20 clients per round)ACC@1-100Clients42.17FedAvg
Federated LearningCIFAR-100 (alpha=1000, 5 clients per round)ACC@1-100Clients54.81FedASAM
Federated LearningCIFAR-100 (alpha=1000, 5 clients per round)ACC@1-100Clients54.01FedSAM
Federated LearningCIFAR-100 (alpha=1000, 5 clients per round)ACC@1-100Clients53.9FedSAM + SWA
Federated LearningCIFAR-100 (alpha=1000, 5 clients per round)ACC@1-100Clients53.86FedASAM + SWA
Federated LearningCIFAR-100 (alpha=1000, 5 clients per round)ACC@1-100Clients49.92FedAvg
Federated LearningCIFAR-100 (alpha=0.5, 10 clients per round)ACC@1-100Clients48.72FedASAM + SWA
Federated LearningCIFAR-100 (alpha=0.5, 10 clients per round)ACC@1-100Clients46.76FedSAM + SWA
Federated LearningCIFAR-100 (alpha=0.5, 10 clients per round)ACC@1-100Clients46.58FedASAM
Federated LearningCIFAR-100 (alpha=0.5, 10 clients per round)ACC@1-100Clients44.84FedSAM
Federated LearningCIFAR-100 (alpha=0.5, 10 clients per round)ACC@1-100Clients41.27FedAvg
Federated LearningCityscapes heterogeneousmIoU49.75SiloBN + ASAM
Federated LearningCityscapes heterogeneousmIoU49.1SiloBN + SAM
Federated LearningCityscapes heterogeneousmIoU45.96SiloBN
Federated LearningCityscapes heterogeneousmIoU43.42FedSAM + SWA
Federated LearningCityscapes heterogeneousmIoU43.02FedASAM + SWA
Federated LearningCityscapes heterogeneousmIoU42.48FedAvg + SWA
Federated LearningCityscapes heterogeneousmIoU42.27FedASAM
Federated LearningCityscapes heterogeneousmIoU41.22FedSAM
Federated LearningCityscapes heterogeneousmIoU38.65FedAvg
Federated LearningLandmarks-User-160kAcc@1-1262Clients68.32FedASAM + SWA
Federated LearningLandmarks-User-160kAcc@1-1262Clients68.12FedSAM + SWA
Federated LearningLandmarks-User-160kAcc@1-1262Clients67.52FedAvg + SWA
Federated LearningLandmarks-User-160kAcc@1-1262Clients64.23FedASAM
Federated LearningLandmarks-User-160kAcc@1-1262Clients63.72FedSAM
Federated LearningLandmarks-User-160kAcc@1-1262Clients61.91FedAvg
Federated LearningCIFAR-100 (alpha=1000, 10 clients per round)ACC@1-100Clients54.97FedASAM
Federated LearningCIFAR-100 (alpha=1000, 10 clients per round)ACC@1-100Clients54.79FedASAM + SWA
Federated LearningCIFAR-100 (alpha=1000, 10 clients per round)ACC@1-100Clients53.67FedSAM + SWA
Federated LearningCIFAR-100 (alpha=1000, 10 clients per round)ACC@1-100Clients53.39FedSAM
Federated LearningCIFAR-100 (alpha=1000, 10 clients per round)ACC@1-100Clients50.25FedAvg
Federated LearningCIFAR-100 (alpha=1000, 20 clients per round)ACC@1-100Clients54.5FedASAM
Federated LearningCIFAR-100 (alpha=1000, 20 clients per round)ACC@1-100Clients54.36FedSAM + SWA
Federated LearningCIFAR-100 (alpha=1000, 20 clients per round)ACC@1-100Clients54.1FedASAM + SWA
Federated LearningCIFAR-100 (alpha=1000, 20 clients per round)ACC@1-100Clients53.97FedSAM
Federated LearningCIFAR-100 (alpha=1000, 20 clients per round)ACC@1-100Clients50.66FedAvg

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17GLAD: Generalizable Tuning for Vision-Language Models2025-07-17