Parameterized k-Clustering: The distance matters!
Abstract
We consider the k-Clustering problem, which is for a given multiset of n vectors X⊂ Zd and a nonnegative number D, to decide whether X can be partitioned into k clusters C1, …, Ck such that the cost \[Σi=1k ci∈ RdΣx ∈ Ci \|x-ci\|pp ≤ D,\] where \|·\|p is the Minkowski (Lp) norm of order p. For p=1, k-Clustering is the well-known k-Median. For p=2, the case of the Euclidean distance, k-Clustering is k-Means. We show that the parameterized complexity of k-Clustering strongly depends on the distance order p. In particular, we prove that for every p∈ (0,1], k-Clustering is solvable in time 2O(D D) (nd)O(1), and hence is fixed-parameter tractable when parameterized by D. On the other hand, we prove that for distances of orders p=0 and p=∞, no such algorithm exists, unless FPT=W[1].
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.