On the complexity of optimal K-anonymity
University of California, Los Angeles · Carnegie Mellon University
Abstract
The technique of k-anonymization has been proposed in the literature as an alternative way to release public information, while ensuring both data privacy and data integrity. We prove that two general versions of optimal k-anonymization of relations are NP-hard, including the suppression version which amounts to choosing a minimum number of entries to delete from the relation. We also present a polynomial time algorithm for optimal k-anonymity that achieves an approximation ratio independent of the size of the database, when k is constant. In particular, it is a O(k log k)-approximation where the constant in the big-O is no more than 4, However, the runtime of the algorithm is exponential in k. A slightly more…
Citation impact
- FWCI
- 46.25
- Percentile
- 100%
- References
- 11
Authors
2Topics & keywords
- k-anonymity
- Relation (database)
- Constant (computer programming)
- Anonymity
- Approximation algorithm
- Computer science
- Time complexity
- Polynomial
- Peace, Justice and strong institutions