articleNucleic Acids ResearchNov 27, 2005GOLD OA

Gene identification in novel eukaryotic genomes by self-training algorithm

Georgia Institute of Technology

PubMed
Indexed incrossrefdoajpubmed

Abstract

Finding new protein-coding genes is one of the most important goals of eukaryotic genome sequencing projects. However, genomic organization of novel eukaryotic genomes is diverse and ab initio gene finding tools tuned up for previously studied species are rarely suitable for efficacious gene hunting in DNA sequences of a new genome. Gene identification methods based on cDNA and expressed sequence tag (EST) mapping to genomic DNA or those using alignments to closely related genomes rely either on existence of abundant cDNA and EST data and/or availability on reference genomes. Conventional statistical ab initio methods require large training sets of validated genes for estimating gene model parameters. In…

Citation impact

1,079
total citations
FWCI
4.80
Percentile
100%
References
47
Citations per year

Authors

1

Topics & keywords

Keywords
  • Genome
  • Biology
  • Gene prediction
  • Computational biology
  • Gene
  • DNA sequencing
  • Genetics
  • Coding region
No related works found for this paper.

Funding