Data Privacy through Optimal k-Anonymization

Bayardo, Roberto J.; Agrawal, R.

doi:10.1109/icde.2005.42

articleApr 19, 2005Closed access

Data Privacy through Optimal k-Anonymization

RJRoberto J. Bayardo RAR. Agrawal

IBM Research - Almaden

Indexed incrossref

Abstract

Data de-identification reconciles the demand for release of data for research purposes and the demand for privacy from individuals. This paper proposes and evaluates an optimization algorithm for the powerful de-identification procedure known as k-anonymization. A k-anonymized dataset has the property that each record is indistinguishable from at least k - 1 others. Even simple restrictions of optimized k-anonymity are NP-hard, leading to significant computational challenges. We present a new approach to exploring the space of possible anonymizations that tames the combinatorics of the problem, and develop data-management strategies to reduce reliance on expensive operations such as sorting. Through…

Citation impact

1,128

total citations

FWCI: 93.71
Percentile: 100%
References: 19

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

k-anonymity
Computer science
Data anonymization
Anonymity
Identification (biology)
Data mining
Coding (social sciences)
Sorting

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.