articleBioinformaticsJun 14, 2011BRONZE OA

PhyloCSF: a comparative genomics method to distinguish protein coding and non-coding regions

Broad Institute · Massachusetts Institute of Technology

PubMed
Indexed incrossrefdoajpubmed

Abstract

MOTIVATION: As high-throughput transcriptome sequencing provides evidence for novel transcripts in many species, there is a renewed need for accurate methods to classify small genomic regions as protein coding or non-coding. We present PhyloCSF, a novel comparative genomics method that analyzes a multispecies nucleotide sequence alignment to determine whether it is likely to represent a conserved protein-coding region, based on a formal statistical comparison of phylogenetic codon models. RESULTS: We show that PhyloCSF's classification performance in 12-species Drosophila genome alignments exceeds all other methods we compared in a previous study. We anticipate that this method will be widely applicable as the…

Citation impact

1,075
total citations
FWCI
15.56
Percentile
100%
References
42
Citations per year

Authors

3

Topics & keywords

Keywords
  • ENCODE
  • Computational biology
  • Biology
  • Genome
  • Comparative genomics
  • Genomics
  • Phylogenetic tree
  • Coding region
UN Sustainable Development Goals
  • Life in Land
No related works found for this paper.

Funding