The MRC IEU OpenGWAS data infrastructure
University of Bristol · National Institute for Health and Care Research · +2 more institutions
Abstract
Abstract Data generated by genome-wide association studies (GWAS) are growing fast with the linkage of biobank samples to health records, and expanding capture of high-dimensional molecular phenotypes. However the utility of these efforts can only be fully realised if their complete results are collected from their heterogeneous sources and formats, harmonised and made programmatically accessible. Here we present the OpenGWAS database, an open source, open access, scalable and high-performance cloud-based data infrastructure that imports and publishes complete GWAS summary datasets and metadata for the scientific community. Our import pipeline harmonises these datasets against dbSNP and the human genome…
Citation impact
- FWCI
- —
- Percentile
- —
- References
- 36
Authors
14Topics & keywords
- Python (programming language)
- Computer science
- Genome-wide association study
- Biobank
- Data mapping
- Metadata
- Data mining
- Database
- Industry, innovation and infrastructure
Funding
- WTWellcome TrustAwards: 209739/Z/17/Z, Z/17/Z, 208806/Z/17/Z
- UHUniversity Hospitals Bristol NHS Foundation Trust
- CRCancer Research UKAwards: MC_UU_00011/1, C18281/A19169, A19169, C18281, C52724/A20138, MC_UU_00011/4
- NINational Institute for Health and Care ResearchAwards: C18281/A19169, MC_UU_00011/4, MC_UU_00011/1
- BHBritish Heart FoundationAwards: MC_UU_00011/1, AA/18/7/34219
- UOUniversity of BristolAwards: MC_UU_00011/4, C18281/A19169, MC_UU_00011/1
- MRMedical Research CouncilAwards: MC_UU_00011/4, MC_UU_00011/1, MC_UU_00011/1, MC_UU_00011