Gfastats: conversion, evaluation and manipulation of genome sequences using assembly graphs
Rockefeller University · University of Freiburg · +4 more institutions
Abstract
MOTIVATION: With the current pace at which reference genomes are being produced, the availability of tools that can reliably and efficiently generate genome assembly summary statistics has become critical. Additionally, with the emergence of new algorithms and data types, tools that can improve the quality of existing assemblies through automated and manual curation are required. RESULTS: We sought to address both these needs by developing gfastats, as part of the Vertebrate Genomes Project (VGP) effort to generate high-quality reference genomes at scale. Gfastats is a standalone tool to compute assembly summary statistics and manipulate assembly sequences in FASTA, FASTQ or GFA [.gz] format. Gfastats stores…
Citation impact
- FWCI
- 69.91
- Percentile
- 100%
- References
- 17
Authors
8Topics & keywords
- Computer science
- Workflow
- Software
- Source code
- Consistency (knowledge bases)
- Sequence assembly
- Visualization
- Graph