Private Geometric Median in Nearly-Linear Time

Abstract

Estimating the geometric median of a dataset is a robust counterpart to mean estimation, and is a fundamental problem in computational geometry. Recently, [HSU24] gave an (, δ)-differentially private algorithm obtaining an α-multiplicative approximation to the geometric median objective, 1 n Σi ∈ [n] \|· - xi\|, given a dataset D := \xi\i ∈ [n] ⊂ Rd. Their algorithm requires n d · 1 α samples, which they prove is information-theoretically optimal. This result is surprising because its error scales with the effective radius of D (i.e., of a ball capturing most points), rather than the worst-case radius. We give an improved algorithm that obtains the same approximation quality, also using n d · 1 αε samples, but in time O(nd + d α2). Our runtime is nearly-linear, plus the cost of the cheapest non-private first-order method due to [CLM+16]. To achieve our results, we use subsampling and geometric aggregation tools inspired by FriendlyCore [TCK+22] to speed up the "warm start" component of the [HSU24] algorithm, combined with a careful custom analysis of DP-SGD's sensitivity for the geometric median objective.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…