Abstract
Clustering, a well-known unsupervised machine learning technique is effective in handling massive amount of data for a variety of applications. Clustering is the process of grou** identical data into clusters that belong to a specific category. Big data consists of massive amounts of data generated every second; hence, clustering this data is quite challenging. Deep learning techniques are used to analyze large amounts of data, which necessitates the use of a large number of samples for training, resulting in a time-consuming and inefficient process. The adhoc fuzzy C means method can be used to overcome this limitation. In this research study, a fusion approach known as fusion clustering is built by combining auto-encoder features with adhoc fuzzy C means (AFCM) technique to improve the fusion approach. The auto-encoder, in combination with the CNN approach, is used for feature engineering to improve the performance, resulting in increased computational speed. The efficiency of the proposed model is improved by evaluating each performance metrics in detail by using three different data types: MNIST, Fashion-MNIST dataset, and USPS. The comparison of each dataset demonstrates that the proposed adhoc fuzzy C means (AFCM) model outperforms other state-of-the-art techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bezdek J (1981) Pattern recognition with objective function algorithms. Plenum, New York, NY, USA
Available. https://doi.org/10.1007/s10994-009-5103-0
Agrawal D, Bernstein P, Bertino E, Davidson S, Dayal U, Franklin M, Gehrke J, Haas L, Halevy A, Han J, Jagadish HV, Labrinidis A, Madden S, Papakon stantinou Y, Patel J, Ramakrishnan R, Ross K, Shahabi C, Dan S, Vaithyanathan S, Widom J (2011) Challenges and opportunities with big data, cyber center technical reports, Purdue University
Ahn B (2012) Neuron machine: Parallel and pipelined digital neurocomputing architecture. In: IEEE international conference on computational intelligence and cybernetics (CyberneticsCom) 2012:143–147
Atluri G, Karpatne A, Kumar V (2018) Spatio-temporal data mining: a survey of problems and methods. ACM Comput Surv. https://doi.org/10.1145/3161602
Deng Y, Ren Z, Kong Y, Bao F, Dai Q (2017) A hierarchical fused fuzzy deep neural network for data classification. IEEE Trans Fuzzy Syst 25(4):1006–1012
Jain AK (2008) “Data clustering: 50 years beyond k-means,” in machine learning and knowledge discovery in databases. Berlin, Germany: Springer, pp 3–4
Yang Y, Ma Z, Yang Y, Nie F, Shen HT (2015) Multitask spectral clustering by exploring intertask correlation. IEEE Trans Cybern 45(5):1069–1080
Havens TC, Bezdek JC, Keller JM, Popescu M, Huband JM (2009) Is VAT single linkage in disguise? Ann Math Artif Intell 55(3–4):237–251
Hore P et al (2009) A scalable framework for segmenting magnetic resonance images. J Signal Process Syst 54(1–3):183–203
Riaz S, Arshad A, Jiao L (2018) Fuzzy rough C-mean based unsupervised CNN clustering for large-scale image data. Appl Sci 8(10):1869
Wu H, Prasad S (2018) ‘Semi-supervised deep learning using pseudo labels for hyperspectral image classification.’ IEEE Trans Image Process 27(3):1259–1270
Rajesh T, Malar RSM (2013) Rough set theory and feed forward neural network based brain tumor detection in magnetic resonance images. In: InInternational Conference on Advanced Nanomaterials & Emerging Engineering Technologies, pp 240–244
https://nlp.stanford.edu/IR-book/html/htmledition/evaluation-of-clustering-1.html
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sreedhara, S.H., Kumar, V., Salma, S. (2023). Efficient Big Data Clustering Using Adhoc Fuzzy C Means and Auto-Encoder CNN. In: Smys, S., Kamel, K.A., Palanisamy, R. (eds) Inventive Computation and Information Technologies. Lecture Notes in Networks and Systems, vol 563. Springer, Singapore. https://doi.org/10.1007/978-981-19-7402-1_25
Download citation
DOI: https://doi.org/10.1007/978-981-19-7402-1_25
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-7401-4
Online ISBN: 978-981-19-7402-1
eBook Packages: EngineeringEngineering (R0)