Mendelian randomization identifies proteins involved in neurodegenerative diseases
Belbasis L., Morris S., Van Duijn C., Bennett D., Walters R.
Proteins are involved in multiple biological functions. High-throughput technologies have allowed the measurement of thousands of proteins in population biobanks. In this study, we aimed to identify proteins related to Alzheimer’s disease (AD), Parkinson’s disease (PD), Multiple Sclerosis (MS) and Amyotrophic Lateral Sclerosis (ALS) by leveraging large-scale genetic and proteomic data. We performed a two-sample cis Mendelian randomization (MR) study by selecting instrumental variables for the abundance of over 2,700 proteins measured by either Olink or SomaScan platforms in plasma from the UK Biobank and the deCODE Health Study. We also used the latest publicly-available genome-wide association studies (GWAS) for the neurodegenerative diseases of interest. The potentially causal effect of proteins on neurodegenerative diseases was estimated based on the Wald ratio. We tested 13,377 protein–disease associations, identifying 169 associations which were statistically significant (5% false discovery rate). Evidence of co-localisation between plasma protein abundance and disease risk (posterior probability >0.80) was identified for 61 proteindisease pairs, leading to 50 unique protein-disease associations. Notably, twenty-three out of 50 protein–disease associations corresponded to genetic loci not previously reported by GWAS. The two-sample MR and co-localisation analysis also showed that APOE abundance in plasma was associated with three sub-cortical volumes (hippocampus, amygdala, and nucleus accumbens) and white matter hyper-intensities, whereas PILRA and PILRB abundance in plasma was associated with caudate nucleus volume. Our study provided a comprehensive assessment of the effect of the human proteome that is currently measurable through two different platforms on neurodegenerative diseases. The newly-associated proteins indicated the involvement of complement (C1S, C1R), microglia (SIRPA, SIGLEC9, PRSS8) and lysosomes (CLN5) in AD, interleukin-6 pathway (CTF1) in PD, lysosomes (TPP1), blood–brain barrier integrity (MFAP2) and astrocytes (TNFSF13) in ALS, and blood–brain barrier integrity (VEGFB), oligodendrocytes (PARP1), node of Ranvier and dorsal root ganglion (NCS1, FLRT3, CDH15), and innate immune system (CR1, AHSG, WARS) in MS. Our study demonstrates how harnessing large-scale genomic and proteomic data can yield novel insights into the role of plasma proteome in the pathogenesis of neurodegenerative diseases.