The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins
Icahn School of Medicine at Mount Sinai
Abstract
Genomics, epigenomics, transcriptomics, proteomics and metabolomics efforts rapidly generate a plethora of data on the activity and levels of biomolecules within mammalian cells. At the same time, curation projects that organize knowledge from the biomedical literature into online databases are expanding. Hence, there is a wealth of information about genes, proteins and their associations, with an urgent need for data integration to achieve better knowledge extraction and data reuse. For this purpose, we developed the Harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins from over 70 major online resources. We extracted, abstracted and organized data into…
Citation impact
- FWCI
- 39.52
- Percentile
- 100%
- References
- 167
Authors
7- ADAndrew D. RouillardCorresponding
Icahn School of Medicine at Mount Sinai
- GWGregory W. Gundersen
Icahn School of Medicine at Mount Sinai
- NFNicolas Fernandez
Icahn School of Medicine at Mount Sinai
- ZWZichen Wang
Icahn School of Medicine at Mount Sinai
- CDCaroline D. Monteiro
Icahn School of Medicine at Mount Sinai
Topics & keywords
- Upload
- Metadata
- Computer science
- Data integration
- Cluster analysis
- Computational biology
- Knowledge extraction
- Information retrieval