SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation
Army Medical University · Southwest Hospital
Abstract
FASTA and FASTQ are basic and ubiquitous formats for storing nucleotide and protein sequences. Common manipulations of FASTA/Q file include converting, searching, filtering, deduplication, splitting, shuffling, and sampling. Existing tools only implement some of these manipulations, and not particularly efficiently, and some are only available for certain operating systems. Furthermore, the complicated installation process of required packages and running environments can render these programs less user friendly. This paper describes a cross-platform ultrafast comprehensive toolkit for FASTA/Q processing. SeqKit provides executable binary files for all major operating systems, including Windows, Linux, and Mac…
Citation impact
- FWCI
- 14.97
- Percentile
- 100%
- References
- 7
Authors
4Topics & keywords
- Executable
- Computer science
- Usability
- Operating system
- Process (computing)
- Software
- Shuffling
- Mac OS