Near-Optimal Coresets of Kernel Density Estimates

Abstract

We construct near-optimal coresets for kernel density estimates for points in Rd when the kernel is positive definite. Specifically we show a polynomial time construction for a coreset of size O(d/· 1/ ), and we show a near-matching lower bound of size (\d/, 1/2\). When d≥ 1/2, it is known that the size of coreset can be O(1/2). The upper bound is a polynomial-in-(1/) improvement when d ∈ [3,1/2) and the lower bound is the first known lower bound to depend on d for this problem. Moreover, the upper bound restriction that the kernel is positive definite is significant in that it applies to a wide-variety of kernels, specifically those most important for machine learning. This includes kernels for information distances and the sinc kernel which can be negative.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…