Bayesian methods for meta-analysis of causal relationships estimated using genetic instrumental variables.
Statistics in medicine 2010 ; 29: 1298-311.
Burgess S, Thompson SG, CRP CHD Genetics Collaboration CRP CHD Genetics Collaboration, Andrews G, Samani NJ, Hall A, Whincup P, Morris R, Lawlor DA, Davey Smith G, Timpson N, Ebrahim S, Ben-Shlomo Y, Brown M, Ricketts S, Sandhu M, Reiner A, Psaty B, Lange L, Cushman M, Hung J, Thompson P, Beilby J, Warrington N, Palmer LJ, Nordestgaard BG, Tybjaerg-Hansen A, Zacho J, Wu C, Lowe G, Tzoulaki I, Kumari M, Yamamoto JF, Chiodini B, Franzosi M, Hankey GJ, Jamrozik K, Palmer L, Rimm E, Pai J, Heckbert S, Bis J, Anand S, Engert J, Collins R, Clarke R, Melander O, Berglund G, Ladenvall P, Johansson L, Jansson JH, Hallmans G, Hingorani A, Humphries S, Manson J, Watkins H, Hopewell J, Saleheen D, Frossard R, Danesh J, Sattar N, Robertson M, Shepherd J, Schaefer E, Hofman A, Witteman JC, Kardys I, de Faire U, Bennet A, Ford I, Packard C, Casas JP, Smeeth L, Wensley F, Bowden J, Di Angelantonio E, Gao P, Shah T, Verzilli C, Walker M, Whittaker J
DOI : 10.1002/sim.3843
PubMed ID : 20209660
PMCID : PMC3648673
URL : https://onlinelibrary.wiley.com/doi/abs/10.1002/sim.3843
Abstract
Genetic markers can be used as instrumental variables, in an analogous way to randomization in a clinical trial, to estimate the causal relationship between a phenotype and an outcome variable. Our purpose is to extend the existing methods for such Mendelian randomization studies to the context of multiple genetic markers measured in multiple studies, based on the analysis of individual participant data. First, for a single genetic marker in one study, we show that the usual ratio of coefficients approach can be reformulated as a regression with heterogeneous error in the explanatory variable. This can be implemented using a Bayesian approach, which is next extended to include multiple genetic markers. We then propose a hierarchical model for undertaking a meta-analysis of multiple studies, in which it is not necessary that the same genetic markers are measured in each study. This provides an overall estimate of the causal relationship between the phenotype and the outcome, and an assessment of its heterogeneity across studies. As an example, we estimate the causal relationship of blood concentrations of C-reactive protein on fibrinogen levels using data from 11 studies. These methods provide a flexible framework for efficient estimation of causal relationships derived from multiple studies. Issues discussed include weak instrument bias, analysis of binary outcome data such as disease risk, missing genetic data, and the use of haplotypes.