Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency

Ren, Shuhuai; Deng, Yihe; He, Kun; Che, Wanxiang

doi:10.18653/v1/p19-1103

articleJan 1, 2019GOLD OA

Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency

SRShuhuai Ren YDYihe Deng KHKun He WCWanxiang Che

University of California, Los Angeles · Huazhong University of Science and Technology · +1 more institution

Indexed incrossref

Abstract

We address the problem of adversarial attacks on text classification, which is rarely studied comparing to attacks on image classification. The challenge of this task is to generate adversarial examples that maintain lexical correctness, grammatical correctness and semantic similarity. Based on the synonyms substitution strategy, we introduce a new word replacement order determined by both the word saliency and the classification probability, and propose a greedy algorithm called probability weighted word saliency (PWWS) for text adversarial attack. Experiments on three popular datasets using convolutional as well as LSTM models show that PWWS reduces the classification accuracy to the most extent, and keeps a…

Citation impact

659

total citations

FWCI: 50.80
Percentile: 100%
References: 29

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Adversarial system
Artificial intelligence
Word (group theory)
Natural language processing
Correctness
Robustness (evolution)
Similarity (geometry)

UN Sustainable Development Goals

Quality Education

No related works found for this paper.