Distinguishing correlation from causation using genome-wide association studies
Abstract
Genome-wide association studies (GWAS) have emerged as a rich source of genetic clues into disease biology, and they have revealed strong genetic correlations among many diseases and traits. Some of these genetic correlations may reflect causal relationships. We developed a method to quantify causal relationships between genetically correlated traits using GWAS summary association statistics. In particular, our method quantifies what part of the genetic component of trait 1 is also causal for trait 2 using mixed fourth moments E(α12α1α2) and E(α22α1α2) of the bivariate effect size distribution. If trait 1 is causal for trait 2, then SNPs affecting trait 1 (large α12) will have correlated effects on trait 2 (large α1α2), but not vice versa. We validated this approach in extensive simulations. Across 52 traits (average N=331k), we identified 30 putative genetically causal relationships, many novel, including an effect of LDL cholesterol on decreased bone mineral density. More broadly, we demonstrate that it is possible to distinguish between genetic correlation and causation using genetic association data.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.