Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features

Nikfarjam, Azadeh; Sarker, Abeed; O’Connor, Karen; Ginn, Rachel; Gonzalez‐Hernandez, Graciela

doi:10.1093/jamia/ocu041

articleJournal of the American Medical Informatics AssociationMar 9, 2015HYBRID OA

Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features

ANAzadeh Nikfarjam ASAbeed Sarker KOKaren O’Connor RGRachel Ginn GGGraciela Gonzalez‐Hernandez

Arizona State University

PubMed

Indexed incrossrefpubmed

Abstract

Objective

Social media is becoming increasingly popular as a platform for sharing personal health-related information. This information can be utilized for public health monitoring tasks, particularly for pharmacovigilance, via the use of natural language processing (NLP) techniques. However, the language in social media is highly informal, and user-expressed medical concepts are often nontechnical, descriptive, and challenging to extract. There has been limited progress in addressing these challenges, and thus far, advanced machine learning-based NLP techniques have been underutilized. Our objective is to design a machine learning-based approach to extract mentions of adverse drug reactions (ADRs) from highly informal text in social media.

Methods

We introduce ADRMine, a machine learning-based concept extraction system that uses conditional random fields (CRFs). ADRMine utilizes a variety of features, including a novel feature for modeling words' semantic similarities. The similarities are modeled by clustering words based on unsupervised, pretrained word representation vectors (embeddings) generated from unlabeled user posts in social media using a deep learning technique.

Citation impact

557

total citations

FWCI: 37.86
Percentile: 100%
References: 86

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Word embedding
Social media
Artificial intelligence
Sequence labeling
Conditional random field
Pharmacovigilance
Machine learning

UN Sustainable Development Goals

Quality Education

No related works found for this paper.