Proteinortho: Detection of (Co-)orthologs in large-scale analysis

Lechner, Marcus; Findeiß, Sven; Steiner, Lydia; Marz, Manja; Stadler, Peter F.; Prohaska, Sonja J.

doi:10.1186/1471-2105-12-124

articleBMC BioinformaticsApr 28, 2011GOLD OA

Proteinortho: Detection of (Co-)orthologs in large-scale analysis

MLMarcus Lechner SFSven Findeiß LSLydia Steiner MMManja Marz PFPeter F. Stadler

Philipps University of Marburg · Leipzig University · +5 more institutions

PubMed

Indexed incrossrefdoajpubmed

Abstract

Background

Orthology analysis is an important part of data analysis in many areas of bioinformatics such as comparative genomics and molecular phylogenetics. The ever-increasing flood of sequence data, and hence the rapidly increasing number of genomes that can be compared simultaneously, calls for efficient software tools as brute-force approaches with quadratic memory requirements become infeasible in practise. The rapid pace at which new data become available, furthermore, makes it desirable to compute genome-wide orthology relations for a given dataset rather than relying on relations listed in databases.

Results

The program Proteinortho described here is a stand-alone tool that is geared towards large datasets and makes use of distributed computing techniques when run on multi-core hardware. It implements an extended version of the reciprocal best alignment heuristic. We apply Proteinortho to compute orthologous proteins in the complete set of all 717 eubacterial genomes available at NCBI at the beginning of 2009. We identified thirty proteins present in 99% of all bacterial proteomes.

Citation impact

1,281

total citations

FWCI: 10.51
Percentile: 100%
References: 37

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Computer science
Genome
Set (abstract data type)
Software
Genomics
Proteome
Computational biology
Bottleneck

No related works found for this paper.

Funding

DF
Deutsche Forschungsgemeinschaft
Award: SPP 1258