Evaluating the Privacy Implications of Frequent Itemset Disclosure
Document Type
Conference Proceeding
Publication Date
2017
DOI
https://doi.org/10.1007/978-3-319-58469-0_34
Abstract
Frequent itemset mining is a fundamental data analytics task. In many cases, due to privacy concerns, only the frequent itemsets are released instead of the underlying data. However, it is not clear how to evaluate the privacy implications of the disclosure of the frequent itemsets. Towards this, in this paper, we define the k-distant-IFM-solutions problem, which aims to find k transaction datasets whose pair distance is maximized. The degree of difference between the reconstructed datasets provides a way to evaluate the privacy risk. Since the problem is NP-hard, we propose a 2-approximate solution as well as faster heuristics, and evaluate them on real data.
Publication Information
Serra, Edoardo; Vaidya, Jaideep; Akella, Haritha; and Sharma, Ashish. (2017). "Evaluating the Privacy Implications of Frequent Itemset Disclosure". IFIP International Conference on ICT Systems Security and Privacy Protection, 506-519.