On the Structural Dimension of Sliced Inverse Regression

Abstract

In this work, we address the longstanding puzzle that Sliced Inverse Regression (SIR) often performs poorly for sufficient dimension reduction when the structural dimension d (the dimension of the central space) exceeds 4. We first show that in the multiple index model Y=f( P X)+ε where X is a p-standard normal vector, ε is an independent noise, and P is a projection operator from Rp to Rd, if the link function f follows the law of a Gaussian process, then with high probability, the d-th eigenvalue λd of Cov[E(X Y)] satisfies λd≤ C e-θ d for some positive constants C and θ. We then focus on the low signal regime where λd can be arbitrarily small and not larger than d-8.1, and prove that the minimax risk of estimating the central space is lower bounded by dpnλd. Combining these two results, we provide a convincing explanation for the poor performance of SIR when d is large, a phenomenon that has perplexed researchers for nearly three decades. The technical tools developed here may be of independent interest for studying other sufficient dimension reduction methods.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…