articleNucleic Acids ResearchNov 26, 2015GOLD OA

The Dfam database of repetitive DNA families

Institute for Systems Biology · European Bioinformatics Institute · +6 more institutions

PubMed
Indexed incrossrefdatacitedoajpubmed

Abstract

Repetitive DNA, especially that due to transposable elements (TEs), makes up a large fraction of many genomes. Dfam is an open access database of families of repetitive DNA elements, in which each family is represented by a multiple sequence alignment and a profile hidden Markov model (HMM). The initial release of Dfam, featured in the 2013 NAR Database Issue, contained 1143 families of repetitive elements found in humans, and was used to produce more than 100 Mb of additional annotation of TE-derived regions in the human genome, with improved speed. Here, we describe recent advances, most notably expansion to 4150 total families including a comprehensive set of known repeat families from four new organisms…

No related works found for this paper.

Funding