Approximating The p-Mean Curve of Large Data-Sets

Abstract

A set of piecewise linear functions, called polylines, P1,…,PL each with at most n vertices can be simplified into a polyline M with k vertices, such that the Fr\'echet distances ε1,…,εL to each of these polylines are minimized under the Lp distance. We call M for Lp with p≥ 1 a p-mean curve (p-MC). We discuss p≥ 1, for which Lp distance satisfies the triangle inequality and p-mean has not been discussed before for most values p. Computing the p-mean polyline is NP-hard for L=(1) and some values of p, so we discuss approximation algorithms. We give a O(n2 k) time exact algorithm for L=2 and p≥ 1. Also, we reduce the Fr\'echet distance to the discrete Fr\'echet distance which adds a factor 2 to both k and ε. Then we use our exact algorithm to find a 3-approximation for L>2 in poly(n,L) time. Our method is based on a generalization of the free-space diagram (FSD) for Fr\'echet distance and composable core-sets for approximate summaries.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…