articleNature CommunicationsMar 18, 2020GOLD OA

GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes

Johns Hopkins University · SIB Swiss Institute of Bioinformatics · +2 more institutions

PubMed
Indexed incrossrefdoajpubmed

Abstract

An important assessment prior to genome assembly and related analyses is genome profiling, where the k-mer frequencies within raw sequencing reads are analyzed to estimate major genome characteristics such as size, heterozygosity, and repetitiveness. Here we introduce GenomeScope 2.0 (https://github.com/tbenavi1/genomescope2.0), which applies combinatorial theory to establish a detailed mathematical model of how k-mer frequencies are distributed in heterozygous and polyploid genomes. We describe and evaluate a practical implementation of the polyploid-aware mixture model that quickly and accurately infers genome properties across thousands of simulated and several real datasets spanning a broad range of…

Citation impact

2,572
total citations
FWCI
161.62
Percentile
100%
References
40
Citations per year

Authors

3

Topics & keywords

Keywords
  • Polyploid
  • Genome
  • Ploidy
  • Loss of heterozygosity
  • Biology
  • Computational biology
  • Profiling (computer programming)
  • Genome size
No related works found for this paper.

Funding