Generalized EXTRA stochastic gradient Langevin dynamics
Langevin algorithms are popular Markov Chain Monte Carlo methods for Bayesian learning, particularly when the aim is to sample from the posterior distribution of a parametric model, given the input data and the prior distribution over the model parameters. Their stochastic versions such as stochastic gradient Langevin dynamics (SGLD) allow iterative learning based on randomly sampled mini-batches of large datasets and are scalable to large datasets. However, when data is decentralized across a network of agents subject to communication and privacy constraints, standard SGLD algorithms cannot be applied. Instead, we employ decentralized SGLD (DE-SGLD) algorithms, where Bayesian learning is performed collaboratively by a network of agents without sharing individual data. Nonetheless, existing DE-SGLD algorithms induce a bias at every agent that can negatively impact performance; this bias persists even when using full batches and is attributable to network effects. Motivated by the EXTRA algorithm and its generalizations for decentralized optimization, we propose the generalized EXTRA stochastic gradient Langevin dynamics, which eliminates this bias in the full-batch setting. Moreover, we show that, in the mini-batch setting, our algorithm provides performance bounds that significantly improve upon those of standard DE-SGLD algorithms in the literature. Our numerical results also demonstrate the efficiency of the proposed approach.
Download the pre-print from arXiv.org
Share on
Citation
@article{gurbuzbalaban2024,
author = {Gurbuzbalaban, Mert and Rafiqul Islam, Mohammad and Wang,
Xiaoyu and Zhu, Lingjiong},
title = {Generalized {EXTRA} Stochastic Gradient {Langevin} Dynamics},
journal = {arXiv preprint},
date = {2024-12-02},
url = {https://mrislambd.github.io/publication/pub3/},
doi = {10.48550/arXiv.2412.01993},
langid = {en}
}