Inferring Correlation Networks from Genomic Survey Data
Massachusetts Institute of Technology · Broad Institute
Abstract
High-throughput sequencing based techniques, such as 16S rRNA gene profiling, have the potential to elucidate the complex inner workings of natural microbial communities - be they from the world's oceans or the human gut. A key step in exploring such data is the identification of dependencies between members of these communities, which is commonly achieved by correlation analysis. However, it has been known since the days of Karl Pearson that the analysis of the type of data generated by such techniques (referred to as compositional data) can produce unreliable results since the observed data take the form of relative fractions of genes or species, rather than their absolute abundances. Using simulated and…
Citation impact
- FWCI
- 15.85
- Percentile
- 100%
- References
- 36
Authors
2Topics & keywords
- Spurious relationship
- Compositional data
- Human Microbiome Project
- Microbiome
- Profiling (computer programming)
- Correlation
- Human microbiome
- Identification (biology)
- Life below water
Funding
- UDU.S. Department of EnergyAwards: -AC02-05CH11231, Contract No. DE-AC02-05CH11231, 05CH11231, No. DE-AC02-05CH11231, AC02-05CH11231, DE-AC02, DE-AC02-05CH11231, DE-AC02-
- OOOffice of ScienceAwards: AC02-05CH11231, -AC02-05CH11231, DE-AC02, No. DE-AC02-05CH11231, Contract No. DE-AC02-05CH11231
- BABiological and Environmental ResearchAwards: 05CH11231, No. DE-AC02-05CH11231, Contract No. DE-AC02-05CH11231, DE-AC02-05CH11231, AC02-05CH11231
- LBLawrence Berkeley National LaboratoryAwards: DE-AC02-05CH11231, No. DE-AC02-05CH11231, Contract No. DE-AC02-05CH11231, 05CH11231, AC02-05CH11231