Efficient Big Data Clustering Using Adhoc Fuzzy C Means and Auto-Encoder CNN

  • Conference paper
  • First Online:
Inventive Computation and Information Technologies

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 563))

Abstract

Clustering, a well-known unsupervised machine learning technique is effective in handling massive amount of data for a variety of applications. Clustering is the process of grou** identical data into clusters that belong to a specific category. Big data consists of massive amounts of data generated every second; hence, clustering this data is quite challenging. Deep learning techniques are used to analyze large amounts of data, which necessitates the use of a large number of samples for training, resulting in a time-consuming and inefficient process. The adhoc fuzzy C means method can be used to overcome this limitation. In this research study, a fusion approach known as fusion clustering is built by combining auto-encoder features with adhoc fuzzy C means (AFCM) technique to improve the fusion approach. The auto-encoder, in combination with the CNN approach, is used for feature engineering to improve the performance, resulting in increased computational speed. The efficiency of the proposed model is improved by evaluating each performance metrics in detail by using three different data types: MNIST, Fashion-MNIST dataset, and USPS. The comparison of each dataset demonstrates that the proposed adhoc fuzzy C means (AFCM) model outperforms other state-of-the-art techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Bezdek J (1981) Pattern recognition with objective function algorithms. Plenum, New York, NY, USA

    Book  MATH  Google Scholar 

  2. Available. https://doi.org/10.1007/s10994-009-5103-0

  3. Agrawal D, Bernstein P, Bertino E, Davidson S, Dayal U, Franklin M, Gehrke J, Haas L, Halevy A, Han J, Jagadish HV, Labrinidis A, Madden S, Papakon stantinou Y, Patel J, Ramakrishnan R, Ross K, Shahabi C, Dan S, Vaithyanathan S, Widom J (2011) Challenges and opportunities with big data, cyber center technical reports, Purdue University

    Google Scholar 

  4. Ahn B (2012) Neuron machine: Parallel and pipelined digital neurocomputing architecture. In: IEEE international conference on computational intelligence and cybernetics (CyberneticsCom) 2012:143–147

    Google Scholar 

  5. Atluri G, Karpatne A, Kumar V (2018) Spatio-temporal data mining: a survey of problems and methods. ACM Comput Surv. https://doi.org/10.1145/3161602

    Article  Google Scholar 

  6. Deng Y, Ren Z, Kong Y, Bao F, Dai Q (2017) A hierarchical fused fuzzy deep neural network for data classification. IEEE Trans Fuzzy Syst 25(4):1006–1012

    Article  Google Scholar 

  7. Jain AK (2008) “Data clustering: 50 years beyond k-means,” in machine learning and knowledge discovery in databases. Berlin, Germany: Springer, pp 3–4

    Google Scholar 

  8. Yang Y, Ma Z, Yang Y, Nie F, Shen HT (2015) Multitask spectral clustering by exploring intertask correlation. IEEE Trans Cybern 45(5):1069–1080

    Article  Google Scholar 

  9. Havens TC, Bezdek JC, Keller JM, Popescu M, Huband JM (2009) Is VAT single linkage in disguise? Ann Math Artif Intell 55(3–4):237–251

    Google Scholar 

  10. Hore P et al (2009) A scalable framework for segmenting magnetic resonance images. J Signal Process Syst 54(1–3):183–203

    Article  Google Scholar 

  11. Riaz S, Arshad A, Jiao L (2018) Fuzzy rough C-mean based unsupervised CNN clustering for large-scale image data. Appl Sci 8(10):1869

    Article  Google Scholar 

  12. Wu H, Prasad S (2018) ‘Semi-supervised deep learning using pseudo labels for hyperspectral image classification.’ IEEE Trans Image Process 27(3):1259–1270

    Article  MathSciNet  MATH  Google Scholar 

  13. Rajesh T, Malar RSM (2013) Rough set theory and feed forward neural network based brain tumor detection in magnetic resonance images. In: InInternational Conference on Advanced Nanomaterials & Emerging Engineering Technologies, pp 240–244

    Google Scholar 

  14. https://www.kaggle.com/bistaumanga/usps-dataset

  15. https://nlp.stanford.edu/IR-book/html/htmledition/evaluation-of-clustering-1.html

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Subia Salma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sreedhara, S.H., Kumar, V., Salma, S. (2023). Efficient Big Data Clustering Using Adhoc Fuzzy C Means and Auto-Encoder CNN. In: Smys, S., Kamel, K.A., Palanisamy, R. (eds) Inventive Computation and Information Technologies. Lecture Notes in Networks and Systems, vol 563. Springer, Singapore. https://doi.org/10.1007/978-981-19-7402-1_25

Download citation

  • DOI: https://doi.org/10.1007/978-981-19-7402-1_25

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-19-7401-4

  • Online ISBN: 978-981-19-7402-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Navigation