How to Find a Joint Probability Distribution of Minimum Entropy (almost) given the Marginals
Abstract
Given two discrete random variables X and Y, with probability distributions p =(p1, … , pn) and q=(q1, … , qm), respectively, denote by C( p, q) the set of all couplings of p and q, that is, the set of all bivariate probability distributions that have p and q as marginals. In this paper, we study the problem of finding the joint probability distribution in C( p, q) of minimum entropy (equivalently, the joint probability distribution that maximizes the mutual information between X and Y), and we discuss several situations where the need for this kind of optimization naturally arises. Since the optimization problem is known to be NP-hard, we give an efficient algorithm to find a joint probability distribution in C( p, q) with entropy exceeding the minimum possible by at most 1, thus providing an approximation algorithm with additive approximation factor of 1. Leveraging on this algorithm, we extend our result to the problem of finding a minimum--entropy joint distribution of arbitrary k≥ 2 discrete random variables X1, … , Xk, consistent with the known k marginal distributions of X1, … , Xk. In this case, our approximation algorithm has an additive approximation factor of k. We also discuss some related applications of our findings.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.