Phenome-wide Mendelian randomization mapping the influence of the plasma proteome on complex diseases

Published on May 5, 2019in bioRxiv13.501
路 DOI :10.1101/627398
Jie Zheng21
Estimated H-index: 21
(UoB: University of Bristol),
Valeriia Haberland8
Estimated H-index: 8
(UoB: University of Bristol)
+ 31 AuthorsShan Luo7
Estimated H-index: 7
(UoB: University of Bristol)
The human proteome is a major source of therapeutic targets. Recent genetic association analyses of the plasma proteome enable systematic evaluation of the causal consequences of variation in protein levels. Here, we estimated the effects of 1002 proteins on 225 phenotypes using two-sample Mendelian randomization (MR) and colocalization. Of 413 associations supported by evidence from MR, 139 (34%) were not supported by results of colocalization analyses, suggesting that genetic confounding may be widespread in naive phenome-wide association studies of proteins. Combining MR and colocalization evidence in cis-only analyses, we identified 105 putatively causal effects between 64 proteins and 51 downstream phenotypes ( Evaluation of historic data from 268 drug development programmes showed that target-indication pairs with MR and colocalization support were considerably more likely to succeed, evidencing the value of this approach in identifying and prioritising potential therapeutic targets.
馃摉 Papers frequently viewed together
20 Authors (Ryan Langdon)
#1Michael Wainberg (Stanford University)H-Index: 15
#2Nasa Sinnott-Armstrong (Stanford University)H-Index: 16
Last. Anshul Kundaje (Stanford University)H-Index: 58
view all 15 authors...
Transcriptome-wide association studies (TWAS) integrate genome-wide association studies (GWAS) and gene expression datasets to identify gene鈥搕rait associations. In this Perspective, we explore properties of TWAS as a potential approach to prioritize causal genes at GWAS loci, by using simulations and case studies of literature-curated candidate causal genes for schizophrenia, low-density-lipoprotein cholesterol and Crohn鈥檚 disease. We explore risk loci where TWAS accurately prioritizes the likel...
#1Alvaro N. Barbeira (U of C: University of Chicago)H-Index: 18
#2Milton Pividori (U of C: University of Chicago)H-Index: 9
Last. Hae Kyung Im (U of C: University of Chicago)H-Index: 35
view all 6 authors...
Integration of genome-wide association studies (GWAS) and expression quantitative trait loci (eQTL) studies is needed to improve our understanding of the biological mechanisms underlying GWAS hits, and our ability to identify therapeutic targets. Gene-level association methods such as PrediXcan can prioritize candidate targets. However, limited eQTL sample sizes and absence of relevant developmental and disease context restrict our ability to detect associations. Here we propose an efficient sta...
#1Terry Solomon (UCSD: University of California, San Diego)H-Index: 7
#2John D. Lapek (UM: University of Montana)H-Index: 15
Last. John-Bjarne Hansen (UNN: University Hospital of North Norway)H-Index: 149
view all 12 authors...
Background: Identifying genetic variation associated with plasma protein levels, and the mechanisms by which they act, could provide insight into alterable processes involved in regulation of prote...
#1Paul M. Ridker (Brigham and Women's Hospital)H-Index: 255
#2Peter Libby (Brigham and Women's Hospital)H-Index: 240
Last. Robert J. Glynn (Brigham and Women's Hospital)H-Index: 167
view all 10 authors...
Aims: Canakinumab, a monoclonal antibody targeting interleukin (IL)-1尾, reduces rates of recurrent cardiovascular events without lowering lipids. It is uncertain, however, to what extent these beneficial cardiovascular outcomes are mediated through interleukin-6 (IL-6) signalling, an issue with substantial pathophysiologic consequences and therapeutic implications. Methods and results: A total of 4833 stable atherosclerosis patients in the Canakinumab Anti-Inflammatory Thrombosis Outcomes Study ...
#3Jie Zheng (UoB: University of Bristol)H-Index: 4
We have undertaken a systematic Mendelian randomization (MR) study using methylation quantitative trait loci (meQTL) as genetic instruments to assess the relationship between genetic variation, DNA methylation and 139 complex traits. Using two-sample MR, we identified 1148 associations across 61 traits where genetic variants were associated with both proximal DNA methylation (i.e. cis-meQTL) and complex trait variation (P鈥<鈥1.39 脳 10-08). Joint likelihood mapping provided evidence that the genet...
#1Tianxi Cai (Harvard University)H-Index: 67
#2Yichi Zhang (Harvard University)H-Index: 3
Last. Jacqueline Honerlaw (Veterans Health Administration)H-Index: 9
view all 102 authors...
Importance Electronic health record (EHR) biobanks containing clinical and genomic data on large numbers of individuals have great potential to inform drug discovery. Individuals with interleukin 6 receptor ( IL6R ) single-nucleotide polymorphisms (SNPs) who are not receiving IL6R blocking therapy have biomarker profiles similar to those treated with IL6R blockers. This gene鈥揹rug pair provides an example to test whether associations of IL6R SNPs with a broad range of phenotypes can inform which ...
#1Valur Emilsson (University of Iceland)H-Index: 52
#2Marjan IlkovH-Index: 10
Last. Vilmundur Gudnason (University of Iceland)H-Index: 159
view all 28 authors...
Proteins circulating in the blood are critical for age-related disease processes; however, the serum proteome has remained largely unexplored. To this end, 4137 proteins covering most predicted extracellular proteins were measured in the serum of 5457 Icelanders over 65 years of age. Pairwise correlation between proteins as they varied across individuals revealed 27 different network modules of serum proteins, many of which were associated with cardiovascular and metabolic disease states, as wel...
#1Jie Zheng (UoB: University of Bristol)H-Index: 4
Background: Identifying phenotypic correlations between complex traits and diseases can provide useful etiological insights. Restricted access to much individual-level phenotype data makes it difficult to estimate large-scale phenotypic correlation across the human phenome. Two state-of-the-art methods, metaCCA and LD score regression, provide an alternative approach to estimate phenotypic correlation using only genome-wide association study (GWAS) summary results. Results: Here, we present an i...
#1Gibran Hemani (UoB: University of Bristol)H-Index: 54
#2Jack Bowden (UoB: University of Bristol)H-Index: 43
Last. George Davey Smith (UoB: University of Bristol)H-Index: 246
view all 3 authors...
: Pleiotropy, the phenomenon of a single genetic variant influencing multiple traits, is likely widespread in the human genome. If pleiotropy arises because the single nucleotide polymorphism (SNP) influences one trait, which in turn influences another ('vertical pleiotropy'), then Mendelian randomization (MR) can be used to estimate the causal influence between the traits. Of prime focus among the many limitations to MR is the unprovable assumption that apparent pleiotropic associations are med...
#1Mee Ri Lee (SNU: Seoul National University)H-Index: 6
#2Youn-Hee Lim (SNU: Seoul National University)H-Index: 30
Last. Yun-Chul HongH-Index: 63
view all 3 authors...
Observational studies have shown that obesity is a major risk factor for hypertension, but unmeasured confounding factors may exist. We used Mendelian randomization (MR) to assess the causal effect of obesity on hypertension. The MR analysis was performed in a well-defined community cohort study of 8832 middle-aged (40鈥69 years) adults in Korea enrolled from 2001 to 2013. We used baseline hypertension and newly diagnosed hypertension during the 10-year follow-up period as the outcome variable. G...
Cited By44
#1Josine Min (UoB: University of Bristol)H-Index: 7
#5Peter Walter (EMBL-EBI: European Bioinformatics Institute)H-Index: 4
Many gene expression quantitative trait locus (eQTL) studies have published their summary statistics, which can be used to gain insight into complex human traits by downstream analyses, such as fine mapping and co-localization. However, technical differences between these datasets are a barrier to their widespread use. Consequently, target genes for most genome-wide association study (GWAS) signals have still not been identified. In the present study, we present the eQTL Catalogue ( null null nu...
#1Cindy G. Boer (EUR: Erasmus University Rotterdam)H-Index: 15
Last. Maris Teder-Laving (UT: University of Tartu)H-Index: 11
view all 155 authors...
Osteoarthritis affects over 300 million people worldwide. Here, we conduct a genome-wide association study meta-analysis across 826,690 individuals (177,517 with osteoarthritis) and identify 100 independently associated risk variants across 11 osteoarthritis phenotypes, 52 of which have not been associated with the disease before. We report thumb and spine osteoarthritis risk variants and identify differences in genetic effects between weight-bearing and non-weight-bearing joints. We identify se...
#1Mohd Anisul (Wellcome Trust Sanger Institute)H-Index: 1
#2Jarrod Shilts (Wellcome Trust Sanger Institute)H-Index: 4
Last. Miguel Carmona (EMBL-EBI: European Bioinformatics Institute)H-Index: 5
view all 0 authors...
Background null The virus SARS-CoV-2 can exploit biological vulnerabilities (e.g. host proteins) in susceptible hosts that predispose to the development of severe COVID-19. null Methods null To identify host proteins that may contribute to the risk of severe COVID-19, we undertook proteome-wide genetic colocalisation tests, and polygenic (pan) and cis-Mendelian randomisation analyses leveraging publicly available protein and COVID-19 datasets. null Results null Our analytic approach identified s...
#1Karin H. Nilsson (University of Gothenburg)H-Index: 8
#2Petra Henning (University of Gothenburg)H-Index: 26
Last. Juha Tuukkanen (University of Oulu)H-Index: 60
view all 21 authors...
With increasing age of the population, countries across the globe are facing a substantial increase in osteoporotic fractures. Genetic association signals for fractures have been reported at the RSPO3 locus, but the causal gene and the underlying mechanism are unknown. Here we show that the fracture reducing allele at the RSPO3 locus associate with increased RSPO3 expression both at the mRNA and protein levels, increased trabecular bone mineral density and reduced risk mainly of distal forearm f...
#1Valentina CiprianiH-Index: 16
#2Anna Tierney (University of Manchester)H-Index: 1
Last. Richard D. Unwin (University of Manchester)H-Index: 30
view all 10 authors...
Age-related macular degeneration (AMD) is a leading cause of vision loss; there is strong genetic susceptibility at the complement factor H (CFH) locus. This locus encodes a series of complement regulators: factor H (FH), a splice variant factor-H-like 1 (FHL-1), and five factor-H-related proteins (FHR-1 to FHR-5), all involved in the regulation of complement factor C3b turnover. Little is known about how AMD-associated variants at this locus might influence FHL-1 and FHR protein concentrations....
#1Z Y Yang (Li Ka Shing Faculty of Medicine, University of Hong Kong)H-Index: 6
#2Rong Yu (PKU: Peking University)H-Index: 1
Last. Weihu Wang (PKU: Peking University)H-Index: 2
view all 4 authors...
PD-1/PD-L1 might have a causal role in operating lung cancer risk. However, such an association has not been investigated in the general population. We assessed whether PD-L1 has an independent effect on lung cancer risk using two-sample Mendelian randomization (MR) based on a proteomic genome-wide association study (3301 health participants) of European ancestry and the International Lung cancer Consortium (11,348 cases and 15,861 controls). Negative control analyses using chronic obstructive p...
#1Lucy J GoudswaardH-Index: 2
#2Joshua A. Bell (UoB: University of Bristol)H-Index: 19
Last. Willem H. OuwehandH-Index: 110
view all 15 authors...
BACKGROUND Variation in adiposity is associated with cardiometabolic disease outcomes, but mechanisms leading from this exposure to disease are unclear. This study aimed to estimate effects of body mass index (BMI) on an extensive set of circulating proteins. METHODS We used SomaLogic proteomic data from up to 2737 healthy participants from the INTERVAL study. Associations between self-reported BMI and 3622 unique plasma proteins were explored using linear regression. These were complemented by ...
#1Valur Emilsson (University of Iceland)H-Index: 52
Last. Vilmundur Gudnason (University of Iceland)H-Index: 159
view all 15 authors...
Abstract null Circulating proteins are prognostic for human outcomes including cancer, heart failure, brain trauma and brain amyloid plaque burden. A deep serum proteome survey recently revealed close associations of serum protein networks and common diseases. The present study reveals unprecedented number of individual serum proteins that overlap genetic signatures of diseases emanating from different tissues of the body. Here, 54,469 low-frequency and common exome-array variants were compared ...
#1Yi Liu (UoB: University of Bristol)H-Index: 6
#2Benjamin Elsworth (UoB: University of Bristol)H-Index: 19
Last. Tom R. Gaunt (UoB: University of Bristol)H-Index: 70
view all 10 authors...
MOTIVATION The wealth of data resources on human phenotypes, risk factors, molecular traits and therapeutic interventions presents new opportunities for population health sciences. These opportunities are paralleled by a growing need for data integration, curation and mining to increase research efficiency, reduce mis-inference and ensure reproducible research. RESULTS We developed EpiGraphDB (, a graph database containing an array of different biomedical and epidemiologi...
This website uses cookies.
We use cookies to improve your online experience. By continuing to use our website we assume you agree to the placement of these cookies.
To learn more, you can find in our Privacy Policy.