InterProScan 5: genome-scale protein function classification
European Bioinformatics Institute · Wellcome Sanger Institute
Abstract
Abstract Motivation: Robust large-scale sequence analysis is a major challenge in modern genomic science, where biologists are frequently trying to characterize many millions of sequences. Here, we describe a new Java-based architecture for the widely used protein function prediction software package InterProScan. Developments include improvements and additions to the outputs of the software and the complete reimplementation of the software framework, resulting in a flexible and stable system that is able to use both multiprocessor machines and/or conventional clusters to achieve scalable distributed data analysis. InterProScan is freely available for download from the EMBl-EBI FTP site and the open source…
Citation impact
- FWCI
- 136.86
- Percentile
- 100%
- References
- 31
Authors
17- PJPhilip JonesCorresponding
European Bioinformatics Institute, Wellcome Sanger Institute
- DBDavid Binns
European Bioinformatics Institute, Wellcome Sanger Institute
- HCHsin-Yu Chang
European Bioinformatics Institute, Wellcome Sanger Institute
- MFMatthew Fraser
European Bioinformatics Institute, Wellcome Sanger Institute
- WLWeizhong Li
European Bioinformatics Institute, Wellcome Sanger Institute
Topics & keywords
- File Transfer Protocol
- Unix
- Computer science
- Java
- Operating system
- Scalability
- Source code
- Software