Do you know what q-means?

Abstract

Clustering is one of the most important tools for analysis of large datasets, and perhaps the most popular clustering algorithm is Lloyd's algorithm for k-means. This algorithm takes n vectors V=[v1,…,vn]∈Rd× n and outputs k centroids c1,…,ck∈Rd; these partition the vectors into clusters based on which centroid is closest to a particular vector. We present a classical -k-means algorithm that performs an approximate version of one iteration of Lloyd's algorithm with time complexity O(\|V\|F2nk2d2(k + n)), exponentially improving the dependence on the data size n and matching that of the "q-means" quantum algorithm originally proposed by Kerenidis, Landman, Luongo, and Prakash (NeurIPS'19). Moreover, we propose an improved q-means quantum algorithm with time complexity O(\|V\|Fnk3/2d(k+d)(k + n)) that quadratically improves the runtime of our classical -k-means algorithm in several parameters. Our quantum algorithm does not rely on quantum linear algebra primitives of prior work, but instead only uses QRAM to prepare simple states based on the current iteration's clusters and multivariate quantum amplitude estimation. Finally, we provide classical and quantum query lower bounds, showing that our algorithms are optimal in most parameters.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…