preprintWellcome Open ResearchJan 17, 2023GOLD OA

Genomes on a Tree (GoaT): A versatile, scalable search engine for genomic and sequencing project metadata across the eukaryotic tree of life

RCRichard ChallisSKSujai KumarCGCibele G. Sotero-CaioMRMax R. BrownMBMark Blaxter

Wellcome Sanger Institute

PubMed
Indexed incrossrefdoajpubmed

Abstract

As genomic data transform our understanding of biodiversity, the Earth BioGenome Project (EBP) has set a goal of generating reference quality genome assemblies for all ~1.9 million described eukaryotic taxa. Meeting this goal requires coordination among many individual regional and taxon-focussed projects working under the EBP umbrella. Large-scale sequencing projects require ready access to validated genome-relevant metadata, such as genome sizes and karyotypes, but these data are dispersed across the literature, and directly measured values are lacking for most taxa. To meet these needs, we have developed Genomes on a Tree (GoaT), an Elasticsearch-powered datastore and search index for genome-relevant…

Citation impact

622
total citations
FWCI
90.27
Percentile
100%
References
22
Citations per year

Authors

5

Topics & keywords

Keywords
  • Metadata
  • Genome
  • Phylogenetic tree
  • Tree (set theory)
  • Scalability
  • Computer science
  • Interface (matter)
  • Information retrieval
UN Sustainable Development Goals
  • Life in Land
No related works found for this paper.

Funding