Fully-Dynamic Coresets

Abstract

With input sizes becoming massive, coresets -- small yet representative summary of the input -- are relevant more than ever. A weighted set Cw that is a subset of the input is an -coreset if the cost of any feasible solution S with respect to Cw is within [1 ] of the cost of S with respect to the original input. We give a very general technique to compute coresets in the fully-dynamic setting where input points can be added or deleted. Given a static -coreset algorithm that runs in time t(n, , λ) and computes a coreset of size s(n, , λ), where n is the number of input points and 1 -λ is the success probability, we give a fully-dynamic algorithm that computes an -coreset with worst-case update time O(( n) · t(s(n, / n, λ/n), / n, λ/n) ) (this bound is stated informally), where the success probability is 1-λ. Our technique is a fully-dynamic analog of the merge-and-reduce technique that applies to insertion-only setting. Although our space usage is O(n), we work in the presence of an adaptive adversary, and we show that (n) space is required when adversary is adaptive. As a consequence, we get fully-dynamic -coreset algorithms for k-median and k-means with worst-case update time O(-2k25 n 3 k) and coreset size O(-2k n 2 k) ignoring n and (1/) factors and assuming that , λ = (1/poly(n)). These are the first fully-dynamic algorithms for k-median and k-means with worst-case update times O(poly(k, n, -1)). We also give conditional lower bound on update/query time for any fully-dynamic (4 - δ)-approximation algorithm for k-means.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…