SINA: Accurate high-throughput multiple sequence alignment of ribosomal RNA genes
Constructor University · Max Planck Institute for Marine Microbiology
Abstract
MOTIVATION: In the analysis of homologous sequences, computation of multiple sequence alignments (MSAs) has become a bottleneck. This is especially troublesome for marker genes like the ribosomal RNA (rRNA) where already millions of sequences are publicly available and individual studies can easily produce hundreds of thousands of new sequences. Methods have been developed to cope with such numbers, but further improvements are needed to meet accuracy requirements. RESULTS: In this study, we present the SILVA Incremental Aligner (SINA) used to align the rRNA gene databases provided by the SILVA ribosomal RNA project. SINA uses a combination of k-mer searching and partial order alignment (POA) to maintain very…
Citation impact
- FWCI
- 65.83
- Percentile
- 100%
- References
- 31
Authors
3Topics & keywords
- Bottleneck
- Benchmark (surveying)
- Computer science
- Ribosomal RNA
- Computational biology
- Throughput
- Sequence (biology)
- Multiple sequence alignment