Javascript must be enabled to continue!
DP-AdamBC: Your DP-Adam Is Actually DP-SGD (Unless You Apply Bias Correction)
View through CrossRef
The Adam optimizer is a popular choice in contemporary deep learning due to its strong empirical performance. However we observe that in privacy sensitive scenarios, the traditional use of Differential Privacy (DP) with the Adam optimizer leads to sub-optimal performance on several tasks. We find that this performance degradation is due to a DP bias in Adam's second moment estimator, introduced by the addition of independent noise in the gradient computation to enforce DP guarantees. This DP bias leads to a different scaling for low variance parameter updates, that is inconsistent with the behavior of non-private Adam, and Adam's sign descent interpretation. We propose the DP-AdamBC optimization algorithm, which corrects for the bias in the second moment estimation and retrieves the expected behaviour of Adam. Empirically, DP-AdamBC significantly improves the optimization performance of DP-Adam by up to 3.5% in final accuracy in image, text, and graph node classification tasks.
Association for the Advancement of Artificial Intelligence (AAAI)
Title: DP-AdamBC: Your DP-Adam Is Actually DP-SGD (Unless You Apply Bias Correction)
Description:
The Adam optimizer is a popular choice in contemporary deep learning due to its strong empirical performance.
However we observe that in privacy sensitive scenarios, the traditional use of Differential Privacy (DP) with the Adam optimizer leads to sub-optimal performance on several tasks.
We find that this performance degradation is due to a DP bias in Adam's second moment estimator, introduced by the addition of independent noise in the gradient computation to enforce DP guarantees.
This DP bias leads to a different scaling for low variance parameter updates, that is inconsistent with the behavior of non-private Adam, and Adam's sign descent interpretation.
We propose the DP-AdamBC optimization algorithm, which corrects for the bias in the second moment estimation and retrieves the expected behaviour of Adam.
Empirically, DP-AdamBC significantly improves the optimization performance of DP-Adam by up to 3.
5% in final accuracy in image, text, and graph node classification tasks.
Related Results
Shaoyao Gancao decoction, an Ancient Classical Prescription: a review on its chemical composition, pharmacology, pharmacokinetics, clinical applications, and toxicology
Shaoyao Gancao decoction, an Ancient Classical Prescription: a review on its chemical composition, pharmacology, pharmacokinetics, clinical applications, and toxicology
Abstract
Objectives
Shaoyao Gancao decoction (SGD) is a famous Ancient Classical Prescription (ACP) from “Treatise on Febrile Di...
Tropical Indian Ocean Mixed Layer Bias in CMIP6 CGCMs Primarily Attributed tothe AGCM Surface Wind Bias
Tropical Indian Ocean Mixed Layer Bias in CMIP6 CGCMs Primarily Attributed tothe AGCM Surface Wind Bias
The relatively weak sea surface temperature bias in the tropical Indian Ocean (TIO) simulated in the coupledgeneral circulation model (CGCM) from the recently released CMIP6 has be...
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract
The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
Sustained upward discharge of relict high-salinity groundwater through salt marsh tidal creeks
Sustained upward discharge of relict high-salinity groundwater through salt marsh tidal creeks
Salt marshes are fine-grained ecosystems widely distributed in the intertidal zone along global coastlines. Recent studies proved that submarine groundwater discharge (SGD) in salt...
Effectiveness of YOLO Architectures in Tree Detection: Impact of Hyperparameter Tuning and SGD, Adam, and AdamW Optimizers
Effectiveness of YOLO Architectures in Tree Detection: Impact of Hyperparameter Tuning and SGD, Adam, and AdamW Optimizers
This study investigates the optimization of tree detection in static images using YOLOv5, YOLOv8, and YOLOv11 models, leveraging a custom non-standard image bank created exclusivel...
Spatiotemporal model based on transformer for bias correction and temporal downscaling of forecasts
Spatiotemporal model based on transformer for bias correction and temporal downscaling of forecasts
Numerical weather prediction (NWP) provides the future state of the atmosphere and is a major tool for weather forecasting. However, NWP has inevitable errors and requires bias cor...
Comparison of Inter‐Observer Bias between Low Resolution and High Resolution Scans using 3T and 7T Scanners
Comparison of Inter‐Observer Bias between Low Resolution and High Resolution Scans using 3T and 7T Scanners
IntroductionMRI can be used to assess atherosclerotic disease severity and to identify plaque components noninvasively. Vessel wall thickening can be measured with MRI and is assoc...
Propagation of biases in humidity in the estimation of global irrigation water
Propagation of biases in humidity in the estimation of global irrigation water
Abstract. Future projections on irrigation water under a changing climate are highly dependent on meteorological data derived from general circulation models (GCMs). Since climate ...

