Low-degree Lower bounds for clustering in moderate dimension

Abstract

We study the fundamental problem of clustering n points into K groups drawn from a mixture of isotropic Gaussians in Rd. Specifically, we investigate the requisite minimal distance between mean vectors to partially recover the underlying partition. While the minimax-optimal threshold for is well-established, a significant gap exists between this information-theoretic limit and the performance of known polynomial-time procedures. Although this gap was recently characterized in the high-dimensional regime (n ≤ dK), it remains largely unexplored in the moderate-dimensional regime (n ≥ dK). In this manuscript, we address this regime by establishing a new low-degree polynomial lower bound for the moderate-dimensional case when d ≥ K. We show that while the difficulty of clustering for n ≤ dK is primarily driven by dimension reduction and spectral methods, the moderate-dimensional regime involves more delicate phenomena leading to a "non-parametric rate". We provide a novel non-spectral algorithm matching this rate, shedding new light on the computational limits of the clustering problem in moderate dimension.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…