On the Relation Between Identifiability, Differential Privacy and Mutual-Information Privacy
Abstract
This paper investigates the relation between three different notions of privacy: identifiability, differential privacy and mutual-information privacy. Under a unified privacy-distortion framework, where the distortion is defined to be the Hamming distance of the input and output databases, we establish some fundamental connections between these three privacy notions. Given a distortion level D, define εi*(D) to be the smallest (best) identifiability level, and εd*(D) to be the smallest differential privacy level. We characterize εi*(D) and εd*(D), and prove that εi*(D)-εXεd*(D)εi*(D) for D in some range, where εX is a constant depending on the distribution of the original database X, and diminishes to zero when the distribution of X is uniform. Furthermore, we show that identifiability and mutual-information privacy are consistent in the sense that given distortion level D, the mechanism that optimizes the mutual-information privacy also minimizes the identifiability level.