articlePLoS Computational BiologyMay 4, 2016GOLD OA

Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes

Centre for Human Genetics · University of Oxford

PubMed
Indexed incrossrefdoajpubmed

Abstract

A central challenge in the analysis of genetic variation is to provide realistic genome simulation across millions of samples. Present day coalescent simulations do not scale well, or use approximations that fail to capture important long-range linkage properties. Analysing the results of simulations also presents a substantial challenge, as current methods to store genealogies consume a great deal of space, are slow to parse and do not take advantage of shared structure in correlated trees. We solve these problems by introducing sparse trees and coalescence records as the key units of genealogical analysis. Using these tools, exact simulation of the coalescent with recombination for chromosome-sized regions…

Citation impact

832
total citations
FWCI
47.70
Percentile
100%
References
131
Citations per year

Authors

3

Topics & keywords

Keywords
  • Coalescent theory
  • Computer science
  • Coalescence (physics)
  • Parsing
  • Variation (astronomy)
  • Range (aeronautics)
  • Theoretical computer science
  • Biology
No related works found for this paper.

Funding