Distribution-Sensitive Bounds on Relative Approximations of Geometric Ranges
Abstract
A family R of ranges and a set X of points together define a range space (X, R|X), where R|X = \X h h ∈ R\. We want to find a structure to estimate the quantity |X h|/|X| for any range h ∈ R with the (, ε)-guarantee: (i) if |X h|/|X| > , the estimate must have a relative error ε; (ii) otherwise, the estimate must have an absolute error ε. The objective is to minimize the size of the structure. Currently, the dominant solution is to compute a relative (, ε)-approximation, which is a subset of X with O(λ/( ε2)) points, where λ is the VC-dimension of (X, R|X), and O hides polylog factors. This paper shows a more general bound sensitive to the content of X. We give a structure that stores O( (1/)) integers plus O(θ · (λ/ε2)) points of X, where θ - called the disagreement coefficient - measures how much the ranges differ from each other in their intersections with X. The value of θ is between 1 and 1/, such that our space bound is never worse than that of relative (, ε)-approximations, but we improve the latter's 1/ term whenever θ = o(1 (1/)). We also prove that, in the worst case, summaries with the (, 1/2)-guarantee must consume (θ) words even for d = 2 and λ 3. We then constrain R to be the set of halfspaces in Rd for a constant d, and prove the existence of structures with o(1/( ε2)) size offering (,ε)-guarantees, when X is generated from various stochastic distributions. This is the first formal justification on why the term 1/ is not compulsory for "realistic" inputs.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.