Membership Inference Attacks From First Principles

Carlini, Nicholas; Chien, Steve; Nasr, Milad; Song, Shuang; Terzis, Andreas; Tramèr, Florian

doi:10.1109/sp46214.2022.9833649

article2022 IEEE Symposium on Security and Privacy (SP)May 1, 2022Closed access

Membership Inference Attacks From First Principles

NCNicholas Carlini SCSteve Chien MNMilad Nasr SSShuang Song ATAndreas Terzis

Google (United States) · University of Massachusetts Amherst

Indexed incrossref

Abstract

A membership inference attack allows an adversary to query a trained machine learning model to predict whether or not a particular example was contained in the model’s training dataset. These attacks are currently evaluated using average-case “accuracy” metrics that fail to characterize whether the attack can confidently identify any members of the training set. We argue that attacks should instead be evaluated by computing their true-positive rate at low (e.g., ≤ 0.1%) false-positive rates, and find most prior attacks perform poorly when evaluated in this way. To address this we develop a Likelihood Ratio Attack (LiRA) that carefully combines multiple ideas from the literature. Our attack is $10\times$ more…

Citation impact

386

total citations

FWCI: 35.93
Percentile: 100%
References: 98

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Computer science
Inference
Computer security
Artificial intelligence

No related works found for this paper.