Estimate dependence in medium dimensions, using ranks and sub-sampling

Abstract

It is well known that non-parametric methods suffer from the "curse of dimensionality". We propose here a new estimation method for a multivariate distribution, using sub-sampling and ranks, which seems not to suffer from this "curse". We prove that in case of independence, the uncertainty of the estimated distribution increases almost linearly w.r.t. the dimension, for dimensions around 6. Otherwise, a simulation study shows that if we use this estimation to build an independence test, the number of observations needed to obtain a given power increases linearly with the dimension. Finally, we give examples of a regression using this estimation: with 3000 observations, in dimension 5, with a markedly complicated dependence, the estimated distribution is graphically very similar to the real one.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…