Trade-off Functions for DP-SGD with Subsampling based on Random Shuffling: Tight Upper and Lower Bounds
Abstract
We derive a tight analysis of the trade-off function for Differentially Private Stochastic Gradient Descent (DP-SGD) with subsampling based on random shuffling within the f-DP framework. Our analysis covers the regime σ≥ 3/ M, where σ is the noise multiplier and M is the number of rounds within a single epoch. Unlike f-DP analyses for Poisson subsampling, which yield non-closed implicit formulas that can be machine computed but are non-transparent, random shuffling admits a tight analysis yielding transparent and interpretable closed-form bounds. Our concrete bounds, derived via the Berry-Esseen theorem, are tight up to constant factors within the proof framework. We demonstrate worked parameter settings for a single epoch (E=1) with a corresponding trade-off function ≥ 1-a-δ, that is, only δ below the ideal random guessing diagonal 1-a: For δ= 1/100 and σ= 1, roughly M ≈ 1.14× 106 rounds and N ≈ 1.14× 107 training samples suffice to achieve meaningful differential privacy. This is in contrast to recent negative results for the regime σ≤ 1/2 M. Our concrete bounds can be composed over multiple epochs leading to δ having a linear in E dependency, which restricts E=O(M). To go beyond Berry--Esseen, we introduce a new proof technique based on a generalization of the law of large numbers that yields an asymptotic random guessing diagonal-limit result: if E=cM2M with cM 0, then the E-fold composed trade-off function satisfies f E(a) 1-a uniformly in a∈[0,1] with δ having only an O(E) dependency. We compare this asymptotic regime with the corresponding Poisson subsampling asymptotic, and highlight the characterization of explicit convergence rates as an open question.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.