Causal Estimation and Inference in Nonlinear Mendelian Randomization Studies
Abstract
Mendelian randomization (MR) is widely used to uncover causal relationships in the presence of unmeasured confounders. However, most existing MR methods presuppose linear causality, risking bias when the true relationships are nonlinear, which is a common empirical scenario. In this paper, we compared two prevalent instrumental variable techniques (the two-stage prediction method and the control function method) under both linear and nonlinear settings, and addressed key issues such as horizontal pleiotropy and violations of classical assumptions in control function method. Most notably, we proposed a flexible semiparametric approach that estimates the causal function without a priori specification, reducing the risk of model misspecification, and extended our methods to binary outcomes, broadening its applicability. For all approaches, we provided estimators, standard errors, and test statistics, to facilitate robust causal inference. Extensive numerical simulations demonstrated that our proposed methods exhibited both accuracy and robustness across diverse scenarios. Applying our methods to UK Biobank data uncovered significant nonlinear causal effects missed by linear MR approaches. We offer an R package implementation for broader and more convenient use.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.