Separating k-Player from t-Player One-Way Communication, with Applications to Data Streams
Abstract
In a k-party communication problem, the k players with inputs x1, x2, …, xk, respectively, want to evaluate a function f(x1, x2, …, xk) using as little communication as possible. We consider the message-passing model, in which the inputs are partitioned in an arbitrary, possibly worst-case manner, among a smaller number t of players (t<k). The t-player communication cost of computing f can only be smaller than the k-player communication cost, since the t players can trivially simulate the k-player protocol. But how much smaller can it be? We study deterministic and randomized protocols in the one-way model, and provide separations for product input distributions, which are optimal for low error probability protocols. We also provide much stronger separations when the input distribution is non-product. A key application of our results is in proving lower bounds for data stream algorithms. In particular, we give an optimal (ε-2(N) (mM)) bits of space lower bound for the fundamental problem of (1ε)-approximating the number \|x\|0 of non-zero entries of an n-dimensional vector x after m integer updates each of magnitude at most M, and with success probability 2/3, in a strict turnstile stream. We additionally prove the matching (ε-2(N) (T)) space lower bound for the problem when we have access to a heavy hitters oracle with threshold T. Our results match the best known upper bounds when ε 1/polylog(mM) and when T = 2poly(1/ε) respectively. It also improves on the prior (ε-2(mM) ) lower bound and separates the complexity of approximating L0 from approximating the p-norm Lp for p bounded away from 0, since the latter has an O(ε-2 (mM)) bit upper bound.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.