A near-linear time approximation scheme for (k,)-median clustering under discrete Fr\'echet distance

Abstract

A time series of complexity m is a sequence of m real valued measurements. The discrete Fr\'echet distance ddF(x,y) is a distance measure between two time series x and y of possibly different complexity. Given a set of n time series represented as m-dimensional vectors over the reals, the (k,)-median problem under discrete Fr\'echet distance aims to find a set C of k time series of complexity such that Σx∈ P c∈ C ddF(x,c) is minimized. In this paper, we give the first near-linear time (1+)-approximation algorithm for this problem when and are constants but k can be as large as (n). We obtain our result by introducing a new dimension reduction technique for discrete Fr\'echet distance and then adapt an algorithm of Cohen-Addad et al. (J. ACM 2021) to work on the dimension-reduced input. As a byproduct we also improve the best coreset construction for (k,)-median under discrete Fr\'echet distance (Cohen-Addad et al., SODA 2025) and show that its size can be independent of the number of input time series and their complexity.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…