Genomes on a Tree (GoaT): A versatile, scalable search engine for genomic and sequencing project metadata across the eukaryotic tree of life
Abstract
As genomic data transform our understanding of biodiversity, the Earth BioGenome Project (EBP) has set a goal of generating reference quality genome assemblies for all ~1.9 million described eukaryotic taxa. Meeting this goal requires coordination among many individual regional and taxon-focussed projects working under the EBP umbrella. Large-scale sequencing projects require ready access to validated genome-relevant metadata, such as genome sizes and karyotypes, but these data are dispersed across the literature, and directly measured values are lacking for most taxa. To meet these needs, we have developed Genomes on a Tree (GoaT), an Elasticsearch-powered datastore and search index for genome-relevant…
Citation impact
- FWCI
- 90.27
- Percentile
- 100%
- References
- 22
Authors
5- RCRichard ChallisCorresponding
Wellcome Sanger Institute
- SKSujai Kumar
Wellcome Sanger Institute
- CGCibele G. Sotero-Caio
Wellcome Sanger Institute
- MRMax R. Brown
Wellcome Sanger Institute
- MBMark Blaxter
Wellcome Sanger Institute
Topics & keywords
- Metadata
- Genome
- Phylogenetic tree
- Tree (set theory)
- Scalability
- Computer science
- Interface (matter)
- Information retrieval
- Life in Land