Abstract
Pattern mining methods help to extract valuable information from a large dataset. The extraction of knowledge might result in the risk of privacy issues. Some potential information might disclosure the insights about customers’ behaviors. This leads us to the issue of privacy-preserving data mining (PPDM) that hides sensitive information as much as possible but remains valid information for the further knowledge discovery methods. In this paper, we first propose a sanitization approach for hiding the sensitive periodic frequent patterns. The proposed method utilizes the Term Frequency and Inverse Document Frequency (TF-IDF) to select the transactions and items for sanitization based on the user-defined sensitive periodic frequent patterns. The designed approach can select the victim items in the transactional database for data sanitization. Experimental results showed that the model can perform better for sparse and dense datasets under different user-defined thresholds.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
C.C. Aggarwal, J. Pei, B. Zhang, On privacy preservation against adversarial data mining, in ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 510–516 (2006)
U. Ahmed, G. Srivastava, J.C.W. Lin, A machine learning model for data sanitization. Comput. Netw. 189, 107914 (2021)
K. Amphawan, P. Lenca, A. Surarerks, Mining top-k periodic-frequent pattern from transactional databases without support threshold, in Advances in Information Technology, pp. 18–29 (2009)
K. Amphawan, A. Surarerks, P. Lenca, P, Mining periodic-frequent itemsets with approximate periodicity using interval transaction-ids list tree, in The International Conference on Knowledge Discovery and Data Mining, pp. 245–248 (2010)
C. Clifton, M. Kantarcioglu, J. Vaidya, X. Lin, M.Y. Zhu, Tools for privacy preserving distributed data mining. ACM SIGKDD Explor. Newsl. 4(2), 28–34 (2002)
M.N. Dehkordi, K. Badie, A.K. Zadeh, A novel method for privacy preserving in association rule mining based on genetic algorithms. J. Soft. 4(6), 555–562 (2009)
C. Dwork, F. McSherry, K. Nissim, A. Smith, Calibrating noise to sensitivity in private data analysis, in Theory of Cryptography Conference, pp. 265–284 (2006)
A. Evfimievski, R. Srikant, R. Agrawal, J. Gehrke, Privacy preserving mining of association rules. Inf. Syst. 29(4), 343–364 (2004)
P. Fournier-Viger, J.W. Lin, B. Vo, T. Truong, J. Zhang, H. Le, A survey of itemset mining. Wiley Interdiscip. Rev.: Data Min. Knowl. Discov. 7(4), e1207 (2017)
P. Fournier-Viger, J.C.W. Lin, Q.H. Dong, D.T. Lan, Phm: mining periodic high-utility itemsets, in Industrial Conference on Data Mining, pp. 64–79 (2016)
P. Fournier-Viger, C.W. Lin, Q.H. Duong, T.L. Dam, L. Ĺ evÄŤĂk, D. Uhrin, M. Voznak, Pfpm: discovering periodic frequent patterns with novel periodicity measures, in The Czech-China Scientific Conference (2017)
P. Fournier-Viger, J.C.W. Lin, A. Gomariz, T. Gueniche, A. Soltani, Z. Deng, H.T. Lam, The spmf open-source data mining library version 2, pp. 36–40 (2016)
P. Fournier-Viger, Y. Wang, P. Yang, J.C.W. Lin, U. Yun, R.U. Kiran, Tspin: mining top-k stable periodic patterns. Appl. Intell., 1–22 (2021)
P. Fournier-Viger, P. Yang, R.U. Kiran, S. Ventura, J.M. Luna, Mining local periodic patterns in a discrete sequence. Inf. Sci. 544, 519–548 (2021)
P. Fournier-Viger, P. Yang, Z. Li, J.C.W. Lin, R.U. Kiran, Discovering rare correlated periodic patterns in multiple sequences. Data Knowl. Eng. 126, 101733 (2020)
P. Fournier-Viger, P. Yang, J.C.W. Lin, R.U. Kiran, Discovering stable periodic-frequent patterns in transactional data, in The International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, pp. 230–244 (2019)
J. Han, J. Pei, Y. Yin, Mining frequent patterns without candidate generation, in ACM SIDMOD International Conference on Management of Data, pp. 1–12 (2000)
T.P. Hong, C.W.Y. Lin, K. Tung, S.L. Wang, Using TF-IDF to hide sensitive itemsets. Appl. Intell. 38(4), 502–510 (2012)
R.U. Kiran, M. Kitsuregawa, P.K. Reddy, Efficient discovery of periodic-frequent patterns in very large databases. J. Syst. Softw. 112, 110–121 (2016)
R.U. Kiran, P.K. Reddy, Mining rare periodic-frequent patterns using multiple minimum supports, in ACM Bangalore Conference, pp. 1–8 (2010)
B.K. Pandya, U.K. Singh, K. Dixit, K. Bunkar, Effectiveness of multiplicative data perturbation for privacy preserving data mining. Int. J. Adv. Res. Comput. Sci. 5(6) (2014)
C.W. Lin, T.P. Hong, C.C. Chang, S.L. Wang, A greedy-based approach for hiding sensitive itemsets by transaction insertion. J. Inf. Hiding Multimed. Signal Process. 4(4), 201–227 (2013)
C.W. Lin, T.P. Hong, H.C. Hsu, Reducing side effects of hiding sensitive itemsets in privacy preserving data mining. Sci. World J. 2014 (2014)
C.W. Lin, T.P. Hong, K.T. Yang, S.L. Wang, The ga-based algorithms for optimizing hiding sensitive itemsets through transaction deletion. Appl. Intell. 42(2), 210–230 (2015)
C.W. Lin, Y. Zhang, B. Zhang, P. Fournier-Viger, Y. Djenouri, Hiding sensitive itemsets with multiple objective optimization. Soft. Comput. 23(4), 12779–12797 (2019)
J.C.W. Lin, Q. Liu, P. Fournier Viger, T.P. Hong, M. Voznak, J. Zhan, A sanitization approach for hiding sensitive itemsets based on particle swarm optimization. Eng. Appl. Artif. Intell. 53, 1–18 (2016)
J.C.W. Lin, G. Srivastava, Y. Zhang, Y. Djenouri, M. Aloqaily, Privacy-preserving multiobjective sanitization model in 6g iot environments. IEEE Internet Things J. 8(7), 5340–5349 (2021)
J.C.W. Lin, T.Y. Wu, P. Fournier-Viger, G. Lin, J. Zhan, M. Voznak, Fast algorithms for hiding sensitive high-utility itemsets in privacy-preserving utility mining. Eng. Appl. Artif. Intell. 55, 269–284 (2016)
Y. Lindell, B. Pinkas, Privacy preserving data mining, in Annual International Cryptology Conference, pp. 36–54 (2000)
P. Fournier-Viger, Z. Li, J.C.W. Lin, R.U. Kiran, H. Fujita, Efficient algorithms to identify periodic patterns in multiple sequences. Inform. Sci. 489, 205–226 (2019)
G. Salton, E.A. Fox, H. Wu, Extended boolean information retrieval. Commun. ACM 26, 1022–1036 (1983)
S.K. Tanbeer, C.F. Ahmed, B. Jeong, Y.K. Lee, Discovering periodic-frequent patterns in transactional databases, in Advances in Knowledge Discovery and Data Mining, pp. 242–253 (2009)
V.S. Verykios, E. Bertino, I.N. Fovino, L.P. Provenza, Y. Saygin, Y. Theodoridis, State-of-the-art in privacy preserving data mining. ACM SIGMOD Record 33(1), 50–57 (2004)
T.Y. Wu, J.C.W. Lin, Y. Zhang, C.H. Chen, The grid-based swarm intelligence algorithm for privacy-preserving data mining. Appl. Sci. 9(4), 774 (2019)
M.J. Zaki, Scalable algorithms for association mining. IEEE Trans. Knowl. Data Eng. 12(3), 372–389 (2000)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Ahmed, U., Lin, J.CW., Fournier-Viger, P. (2021). Privacy Preservation of Periodic Frequent Patterns Using Sensitive Inverse Frequency. In: Kiran, R.U., Fournier-Viger, P., Luna, J.M., Lin, J.CW., Mondal, A. (eds) Periodic Pattern Mining . Springer, Singapore. https://doi.org/10.1007/978-981-16-3964-7_12
Download citation
DOI: https://doi.org/10.1007/978-981-16-3964-7_12
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3963-0
Online ISBN: 978-981-16-3964-7
eBook Packages: Computer ScienceComputer Science (R0)