articleApr 19, 2005Closed access

Data Privacy through Optimal k-Anonymization

IBM Research - Almaden

Indexed incrossref

Abstract

Data de-identification reconciles the demand for release of data for research purposes and the demand for privacy from individuals. This paper proposes and evaluates an optimization algorithm for the powerful de-identification procedure known as k-anonymization. A k-anonymized dataset has the property that each record is indistinguishable from at least k - 1 others. Even simple restrictions of optimized k-anonymity are NP-hard, leading to significant computational challenges. We present a new approach to exploring the space of possible anonymizations that tames the combinatorics of the problem, and develop data-management strategies to reduce reliance on expensive operations such as sorting. Through…

Citation impact

1,128
total citations
FWCI
93.71
Percentile
100%
References
19
Citations per year

Authors

2

Topics & keywords

Keywords
  • k-anonymity
  • Computer science
  • Data anonymization
  • Anonymity
  • Identification (biology)
  • Data mining
  • Coding (social sciences)
  • Sorting
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.