Fast, scalable generation of high‐quality protein multiple sequence alignments using Clustal Omega
University College Dublin · Genome Institute of Singapore · +8 more institutions
Abstract
Multiple sequence alignments are fundamental to many sequence analysis methods. Most alignments are computed using the progressive alignment heuristic. These methods are starting to become a bottleneck in some analysis pipelines when faced with data sets of the size of many thousands of sequences. Some methods allow computation of larger data sets while sacrificing quality, and others produce high-quality alignments, but scale badly with the number of sequences. In this paper, we describe a new program called Clustal Omega, which can align virtually any number of protein sequences quickly and that delivers accurate alignments. The accuracy of the package on smaller test cases is similar to that of the…
Citation impact
- FWCI
- 137.41
- Percentile
- 100%
- References
- 28
Authors
12Topics & keywords
- Bottleneck
- Multiple sequence alignment
- Scalability
- Sequence alignment
- Alignment-free sequence analysis
- Computer science
- Sequence (biology)
- Heuristic