DANN: a deep learning approach for annotating the pathogenicity of genetic variants
University of California, Irvine · Irvine University
Abstract
UNLABELLED: Annotating genetic variants, especially non-coding variants, for the purpose of identifying pathogenic variants remains a challenge. Combined annotation-dependent depletion (CADD) is an algorithm designed to annotate both coding and non-coding variants, and has been shown to outperform other annotation algorithms. CADD trains a linear kernel support vector machine (SVM) to differentiate evolutionarily derived, likely benign, alleles from simulated, likely deleterious, variants. However, SVMs cannot capture non-linear relationships among the features, which can limit performance. To address this issue, we have developed DANN. DANN uses the same feature set and training data as CADD to train a deep…
Citation impact
- FWCI
- 22.79
- Percentile
- 100%
- References
- 11
Authors
3Topics & keywords
- Computer science
- Artificial intelligence
- Support vector machine
- Dropout (neural networks)
- Annotation
- Machine learning
- Source code
- Artificial neural network