Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

DP-AdamBC: Your DP-Adam Is Actually DP-SGD (Unless You Apply Bias Correction)

View through CrossRef
The Adam optimizer is a popular choice in contemporary deep learning due to its strong empirical performance. However we observe that in privacy sensitive scenarios, the traditional use of Differential Privacy (DP) with the Adam optimizer leads to sub-optimal performance on several tasks. We find that this performance degradation is due to a DP bias in Adam's second moment estimator, introduced by the addition of independent noise in the gradient computation to enforce DP guarantees. This DP bias leads to a different scaling for low variance parameter updates, that is inconsistent with the behavior of non-private Adam, and Adam's sign descent interpretation. We propose the DP-AdamBC optimization algorithm, which corrects for the bias in the second moment estimation and retrieves the expected behaviour of Adam. Empirically, DP-AdamBC significantly improves the optimization performance of DP-Adam by up to 3.5% in final accuracy in image, text, and graph node classification tasks.
Title: DP-AdamBC: Your DP-Adam Is Actually DP-SGD (Unless You Apply Bias Correction)
Description:
The Adam optimizer is a popular choice in contemporary deep learning due to its strong empirical performance.
However we observe that in privacy sensitive scenarios, the traditional use of Differential Privacy (DP) with the Adam optimizer leads to sub-optimal performance on several tasks.
We find that this performance degradation is due to a DP bias in Adam's second moment estimator, introduced by the addition of independent noise in the gradient computation to enforce DP guarantees.
This DP bias leads to a different scaling for low variance parameter updates, that is inconsistent with the behavior of non-private Adam, and Adam's sign descent interpretation.
We propose the DP-AdamBC optimization algorithm, which corrects for the bias in the second moment estimation and retrieves the expected behaviour of Adam.
Empirically, DP-AdamBC significantly improves the optimization performance of DP-Adam by up to 3.
5% in final accuracy in image, text, and graph node classification tasks.

Related Results

Quantification of Submarine Groundwater Discharge in the Gaza Strip
Quantification of Submarine Groundwater Discharge in the Gaza Strip
Gaza Strip has suffered from seawater intrusion during the past three decades due to low rainfall and high abstraction from the groundwater resource. On a yearly basis, more than 1...
Tropical Indian Ocean Mixed Layer Bias in CMIP6 CGCMs Primarily Attributed tothe AGCM Surface Wind Bias
Tropical Indian Ocean Mixed Layer Bias in CMIP6 CGCMs Primarily Attributed tothe AGCM Surface Wind Bias
The relatively weak sea surface temperature bias in the tropical Indian Ocean (TIO) simulated in the coupledgeneral circulation model (CGCM) from the recently released CMIP6 has be...
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
Sustained upward discharge of relict high-salinity groundwater through salt marsh tidal creeks
Sustained upward discharge of relict high-salinity groundwater through salt marsh tidal creeks
Salt marshes are fine-grained ecosystems widely distributed in the intertidal zone along global coastlines. Recent studies proved that submarine groundwater discharge (SGD) in salt...
Spatiotemporal model based on transformer for bias correction and temporal downscaling of forecasts
Spatiotemporal model based on transformer for bias correction and temporal downscaling of forecasts
Numerical weather prediction (NWP) provides the future state of the atmosphere and is a major tool for weather forecasting. However, NWP has inevitable errors and requires bias cor...
Effectiveness of YOLO Architectures in Tree Detection: Impact of Hyperparameter Tuning and SGD, Adam, and AdamW Optimizers
Effectiveness of YOLO Architectures in Tree Detection: Impact of Hyperparameter Tuning and SGD, Adam, and AdamW Optimizers
This study investigates the optimization of tree detection in static images using YOLOv5, YOLOv8, and YOLOv11 models, leveraging a custom non-standard image bank created exclusivel...
Comparison of Inter‐Observer Bias between Low Resolution and High Resolution Scans using 3T and 7T Scanners
Comparison of Inter‐Observer Bias between Low Resolution and High Resolution Scans using 3T and 7T Scanners
IntroductionMRI can be used to assess atherosclerotic disease severity and to identify plaque components noninvasively. Vessel wall thickening can be measured with MRI and is assoc...

Back to Top