Evolutionary and methodological considerations when interpreting gene presence–absence variation in pangenomes
Lawrence Berkeley National Laboratory · Joint Genome Institute · +1 more institution
Abstract
While graph-based pangenomes have become a standard and interoperable foundation for comparisons across multiple reference genomes, integrating protein-coding gene annotations across pangenomes in a single 'pangene set' remains challenging, both because of methodological inconsistency and biological presence-absence variation (PAV). Here, we review and experimentally evaluate the root of genome annotation and pangene set inconsistency using two polyploid plant pangenomes: cotton and soybean, which were chosen because of their existing diverse high-quality genomic resources and the known importance of gene PAV in their respective breeding programs. We first demonstrate that building pangene sets across…
Citation impact
- FWCI
- 78.07
- Percentile
- 100%
- References
- 0
Authors
4- TBTomáš BrůnaCorresponding
Lawrence Berkeley National Laboratory, Joint Genome Institute
- ASAvinash Sreedasyam
Lawrence Berkeley National Laboratory, Joint Genome Institute, HudsonAlpha Institute for Biotechnology
- AMAvril M. Harder
HudsonAlpha Institute for Biotechnology
- JTJohn T Lovell
Lawrence Berkeley National Laboratory, Joint Genome Institute, HudsonAlpha Institute for Biotechnology
Topics & keywords
- Annotation
- Genome
- Gene Annotation
- Variation (astronomy)
- Gene
- Inference
- Structural variation
- Gene prediction
Funding
- UDU.S. Department of EnergyAwards: -AC02-05CH11231, 05CH11231, AC02-05CH11231, DE-AC02, DE-AC02-05CH11231, DE-AC02-
- UDU.S. Department of AgricultureAward: DE-AC02-05CH11231
- CICotton IncorporatedAward: 18-753
- JGJoint Genome InstituteAwards: DE-AC02-05CH11231, AC02-05CH11231
- NINational Institute of Food and Agriculture
- OOOffice of ScienceAwards: AC02-05CH11231, -AC02-05CH11231, DE-AC02