articleGenome biologyOct 16, 2020GOLD OA

The design and construction of reference pangenome graphs with minigraph

Harvard University · Dana-Farber Cancer Institute · +1 more institution

PubMed
Indexed incrossrefdoajpubmed

Abstract

The recent advances in sequencing technologies enable the assembly of individual genomes to the quality of the reference genome. How to integrate multiple genomes from the same species and make the integrated representation accessible to biologists remains an open challenge. Here, we propose a graph-based data model and associated formats to represent multiple genomes while preserving the coordinate of the linear reference genome. We implement our ideas in the minigraph toolkit and demonstrate that we can efficiently construct a pangenome graph and compactly encode tens of thousands of structural variants missing from the current reference genome.

Citation impact

496
total citations
FWCI
19.09
Percentile
100%
References
69
Citations per year

Authors

3

Topics & keywords

Keywords
  • ENCODE
  • Genome
  • Reference genome
  • Biology
  • Graph
  • Computational biology
  • Human genetics
  • Representation (politics)
No related works found for this paper.

Funding