Density Ratio-based Causal Discovery from Bivariate Continuous-Discrete Data

Abstract

We address the problem of inferring the causal direction between a continuous variable X and a discrete variable Y from observational data. For the model X Y, we adopt the threshold model used in prior work. For the model Y X, we consider two cases: (1) the conditional distributions of X given different values of Y form a location-shift family, and (2) they are mixtures of generalized normal distributions with independently parameterized components. We establish identifiability of the causal direction through three theoretical results. First, we prove that under X Y, the density ratio of X conditioned on different values of Y is monotonic. Second, we establish that under Y X with non-location-shift conditionals, monotonicity of the density ratio holds only on a set of Lebesgue measure zero in the parameter space. Third, we show that under X Y, the conditional distributions forming a location-shift family requires a precise coordination between the causal mechanism and input distribution, which is non-generic under the principle of independent mechanisms. Together, these results imply that monotonicity of the density ratio characterizes the direction X Y, whereas non-monotonicity or location-shift conditionals characterizes Y X. Based on this, we propose Density Ratio-based Causal Discovery (DRCD), a method that determines causal direction by testing for location-shift conditionals and monotonicity of the estimated density ratio. Experiments on synthetic and real-world datasets demonstrate that DRCD outperforms existing methods.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…