Logarithmic Asymptotic Relations Between p-Values and Mutual Information
Abstract
We establish a precise connection between statistical significance in dependence testing and information-theoretic dependence as quantified by Shannon mutual information (MI). In the absence of prior distributional information, we consider a maximum-entropy model and show that the probability associated with the realization of a given magnitude of MI takes an exponential form, yielding a corresponding tail-probability interpretation of a p-value. In contingency tables with fixed marginal frequencies, we analyze Fisher's exact test and prove that its p-value PF satisfies a logarithmic asymptotic relation of the form MI=-(1/N) PF + O((N+1)/N) as the sample size N∞. These results clarify the role of MI as the exponential rate governing the asymptotic behavior of p-values in the settings studied here, and they enable principled comparisons of dependence across datasets with different sample sizes. We further discuss implications for combining evidence across studies via meta-analysis, allowing mutual information and its statistical significance to be integrated in a unified framework.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.