articleMethodsDec 5, 2014HYBRID OA

DISEASES: Text mining and data integration of disease–gene associations

Novo Nordisk Foundation · University of Copenhagen · +2 more institutions

PubMed
Indexed incrossrefpubmed

Abstract

Text mining is a flexible technology that can be applied to numerous different tasks in biology and medicine. We present a system for extracting disease-gene associations from biomedical abstracts. The system consists of a highly efficient dictionary-based tagger for named entity recognition of human genes and diseases, which we combine with a scoring scheme that takes into account co-occurrences both within and between sentences. We show that this approach is able to extract half of all manually curated associations with a false positive rate of only 0.16%. Nonetheless, text mining should not stand alone, but be combined with other types of evidence. For this reason, we have developed the DISEASES resource,…

No related works found for this paper.