Additive Approximation Schemes for Low-Dimensional Embeddings

Abstract

We consider the task of fitting low-dimensional embeddings to high-dimensional data. In particular, we study the k-Euclidean Metric Violation problem (k-EMV), where the input is D ∈ Rn2≥ 0 and the goal is to find the closest vector X ∈ Mk, where Mk ⊂ Rn2≥ 0 is the set of all k-dimensional Euclidean metrics on n points, and closeness is formulated as the following optimization problem, where \| · \| is the entry-wise 2 norm: \[ OPTEMV = X ∈ Mk D - X 22\,.\] Cayton and Dasgupta [CD'06] showed that this problem is NP-Hard, even when k=1. Dhamdhere [Dha'04] obtained a O((n))-approximation for 1-EMV and leaves finding a PTAS for it as an open question (reiterated recently by Lee [Lee'25]). Although k-EMV has been studied in the statistics community for over 70 years, under the name "multi-dimensional scaling", there are no known efficient approximation algorithms for k > 1, to the best of our knowledge. We provide the first polynomial-time additive approximation scheme for k-EMV. In particular, we obtain an embedding with objective value OPTEMV + D22 in (n· B)poly(k, -1) time, where each entry in D can be represented by B bits. We believe our algorithm is a crucial first step towards obtaining a PTAS for k-EMV. Our key technical contribution is a new analysis of correlation rounding for Sherali-Adams / Sum-of-Squares relaxations, tailored to low-dimensional embeddings. We also show that our techniques allow us to obtain additive approximation schemes for two related problems: a weighted variant of k-EMV and p low-rank approximation for p>2.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…