On a Probability Inequality for Order Statistics with Applications to Bootstrap, Conformal Prediction, and more

Abstract

``Behind every limit theorem, there is an inequality'' said Kolmogorov. We say ``for every inequality, there is an approximate inequality under approximate regularity conditions.'' Suppose X, X' are independent and identically distributed random variables. Then X X' with a probability of at least 1/2, irrespective of the underlying (common) distribution. One can ask what happens to the probability if X, X' are independent but not identically distributed. It should be approximately 1/2 if the distributions are approximately equal. Similarly, what if the random variables are dependent? It should, again, be approximately 1/2 if the random variables are approximately independent. We explore an extension of this probability inequality involving order statistics and develop approximate versions of such an inequality under violations of independence and identical distribution assumptions. We further show that this inequality can be used as a basis to prove asymptotic validity of bootstrap/subsampling, finite-sample validity of conformal prediction, permutation tests, and asymptotic validity of rank tests without group invariance. Specifically, in the context of resampling inference, our results can be seen as a finite-sample instantiation of some results by Peter Hall and yield an alternative ``cheap bootstrap'' that applies to high-dimensional data.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…