A family of Chatterjee's correlation coefficients and their properties

Abstract

Quantifying the strength of functional dependence between random scalars X and Y is an important statistical problem. While many existing correlation coefficients excel in identifying linear or monotone functional dependence, they fall short in capturing general non-monotone functional relationships. In response, we propose a family of correlation coefficients (h,F)n, characterized by a continuous bivariate function h and a cdf function F. By offering a range of selections for h and F, (h,F)n encompasses a diverse class of novel correlation coefficients, while also incorporates the Chatterjee's correlation coefficient (Chatterjee, 2021) as a special case. We prove that (h,F)n converges almost surely to a deterministic limit (h,F) as sample size n approaches infinity. In addition, under appropriate conditions imposed on h and F, the limit (h,F) satisfies the three appealing properties: (P1). it belongs to the range of [0,1]; (P2). it equals 1 if and only if Y is a measurable function of X; and (P3). it equals 0 if and only if Y is independent of X. As amplified by our numerical experiments, our proposals provide practitioners with a variety of options to choose the most suitable correlation coefficient tailored to their specific practical needs.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…