Abstract
Big Data (BD) is a concept that deals with enormous amounts of data storage, processing, and analysis. With the exponential advancement in the evolution of cloud computing domains in healthcare (HC), the security and confidentiality of medical records have evolved into a primary consideration for HC services and applications. There needs to be more than the present-day cryptosystems to address these troubles. Therefore, this paper introduces a novel Three-Factor Authentication (3FA) and optimal Map-Reduce (MR) framework for secure BD transmission over the cloud with Secure Hashing Authentication XOR-ed Elliptical Curve Cryptography (SHAXECC). The authentication procedure is initially carried out with the SHA-512 algorithm, which protects the network from unauthorized access. Next, data deduplication is done using the SHA-512 algorithm to eliminate duplicate files. After that, an optimal MR design is introduced to handle a large amount of BD. In an optimal MR, the mapper uses the Modified Fuzzy C-means (MFCM) clustering approach to initially form the BD clusters. Then, the reducer uses the Levy Flight and Scoring Mutation-based Chimp Optimization Algorithm (LSCOA) to form final BD clusters. Finally, the SHAXECC is used to transmit the data securely. Experiments are performed to compare the superiority of the proposed technique with the existing techniques in terms of some performance measures. The proposed approach outperformed other existing models concerning clustering and security measures. So, the proposed model is the best for data protection and privacy in cloud-enabled HC data.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18147-6/MediaObjects/11042_2024_18147_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18147-6/MediaObjects/11042_2024_18147_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18147-6/MediaObjects/11042_2024_18147_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18147-6/MediaObjects/11042_2024_18147_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18147-6/MediaObjects/11042_2024_18147_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18147-6/MediaObjects/11042_2024_18147_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18147-6/MediaObjects/11042_2024_18147_Fig7_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18147-6/MediaObjects/11042_2024_18147_Fig8_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18147-6/MediaObjects/11042_2024_18147_Fig9_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18147-6/MediaObjects/11042_2024_18147_Fig10_HTML.png)
Similar content being viewed by others
![](https://media.springernature.com/w215h120/springer-static/image/art%3A10.1007%2Fs11042-019-7677-2/MediaObjects/11042_2019_7677_Fig1_HTML.png)
Data availability
The dataset used for the present work is the publicly available, https://www.kaggle.com/datasets/johnsmith88/heart-disease-dataset/code.
References
Abouelmehdi K, Beni-Hessane A, Khaloufi H (2018) Big healthcare data: preserving security and privacy. J Big Data 5(1):1–18
Rajabion L, Shaltooki AA, Taghikhah M, Ghasemi A, Badfar A (2019) Healthcare big data processing mechanisms: The role of cloud computing. Int J Inf Manag 49:271–289
**ong J, Zhang Y, Tang S, Liu X, Yao Z (2019) Secure encrypted data with authorized deduplication in cloud. IEEE Access 7:75090–75104
Chenthara S, Ahmed K, Wang H, Whittaker F (2019) Security and privacy-preserving challenges of e-health solutions in cloud computing. IEEE Access 7:74361–74382
Alexandru A, Alexandru C, Coardos D, Tudora E (2016) Healthcare, Big Data and Cloud Computing. WSEAS Transactions on Computer Research. 4:123–131
Sardar TH, Ansari Z (2022) MapReduce-based fuzzy C-means algorithm for distributed document clustering. J Inst Eng (India): Series B, pp 1–12
Wu C, Yan B, Yu R, Yu B, Zhou X, Yu Y, Chen N (2021) K-means clustering algorithm and its simulation based on distributed computing platform. Complexity 1–10
Sagheer NS, Yousif SA (2021) Canopy with k-means clustering algorithm for big data analytics, In AIP Conference Proceedings 2334(1)
Wang L, Wang B, Song W, Zhang Z (2019) A key-sharing based secure deduplication scheme in cloud storage. Inf Sci 504:48–60
Rehman A, Haseeb K, Saba T, Lloret J, Tariq U (2021) Secured big data analytics for decision-oriented medical system using internet of things. Electronics 10:1–13
Li S, Chunxiang Xu, Zhang Y (2019) CSED: Client-Side encrypted deduplication scheme based on proofs of ownership for cloud storage. J Inf Secur Appl 46:250–258
Bathla G, Aggarwal H, Rani R (2018) A novel approach for clustering big data based on mapreduce. Int J Electr Comput Eng (IJECE) 8(3):1711–1719
Guan S, Zhang C, Wang Y, Liu W (2023) Hadoop-based secure storage solution for big data in cloud computing environment, Digit Commun Netw
Jain P, Gyanchandani M, Khare N (2019) Enhanced secured map reduce layer for big data privacy and security. J Big Data 6(1):1–17
Khatiwada P, Bhusal H, Chatterjee A, Gerdes MW (2020) A proposed access control-based privacy preservation model to share healthcare data in cloud”, 16th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob), 12–14 October 2020, Thessaloniki, Greece
Thangarasu G, Dominic PDD, Subramanian K, Smiley S (2019) Biometric based signature authentication scheme for cloud healthcare data security. https://doi.org/10.1007/978-3-319-99007-1_52
Usharani AV, Attigeri G (2022) Secure EMR classification and deduplication using mapreduce. IEEE Access 10:34404–34414
Tamilarasi K, Jawahar A (2020) Medical data security for healthcare applications using hybrid lightweight encryption and swarm optimization algorithm. Wireless Pers Commun 114(3):1865–1886
Denis R, Madhubala P (2021) Hybrid data encryption model integrating multi-objective adaptive genetic algorithm for secure medical data communication over cloud-based healthcare systems. Multimedia Tools Appl 80:21165–21202. https://doi.org/10.1007/s11042-021-10723-4
Farid F, Elkhodr M, Sabrina F, Ahamed F, Gide E (2021) A smart biometric identity management framework for personalised IoT and cloud computing-based healthcare services. Sensors 21(2):1–18
Darvazeh SS, Vanani IR, Musolu FM (2020) Big data analytics and its applications in supply chain management. New Trends Use Artif Intell Ind 4:175. https://doi.org/10.5772/intechopen.89426. (ISBN: 978-1-83880-142-7)
Shankar SK, Tomar AS, Tak GK (2015) Secure medical data transmission by using ECC with mutual authentication in WSNs. Procedia Comput Sci 70:455–461
Chen C-L, Huang P-T, Deng Y-Y, Chen H-C, Wang Y-C (2020) A secure electronic medical record authorization system for smart device application in cloud computing environments. HCIS 10(1):1–31
Prasanalakshmi B, Murugan K, Srinivasan K, Shridevi S, Shamsudheen S, Hu Y-C (2021) Improved authentication and computation of medical data transmission in the secure IoT using hyperelliptic curve cryptography. J Supercomput 78(6):1–18
Bhateja V, Satapathy SC, Satori H (2020) Embedded systems and artificial intelligence, Springer Nature Singapore, ISBN: ISBN 978–981–15–0947–6
Liu X, Tingting Lu, He X, Yang X, Niu S (2020) Verifiable attribute-based keyword search over encrypted cloud data supporting data deduplication. IEEE Access 8:52062–52074
Tabrizchi H, Kuchaki-Rafsanjan M (2020) A survey on security challenges in cloud computing: issues, threats and solutions. J Supercomput 76:9493–9532
Essa YM, Hemdan EED, El-Mahalawy A, Attiya G, El-Sayed A (2019) IFHDS: Intelligent framework for securing healthcare bigdata. J Med Syst 43:1–13
Awad FH, Hamad MM (2022) Improved k-means clustering algorithm for big data based on distributed smartphoneneural engine processor. Electronics 11(6):883
Usha Lawrance J, Nayahi Jesudhasan JV (2021) Privacy preserving parallel clustering-based anonymization for big data using mapreduce framework. Appl Artif Intell 35(15):1587–1620
Dandugala LS, Vani KS (2022) A weighted k-means clustering algorithm for clustering big data based on mapreduce
Vankdothu R, Hameed MA, Bhukya R, Garg G (2023) Entropy and sigmoid based K-means clustering and AGWO for effective big data handling. Multimedia Tools Appl 82(10):15287–15304
Nguyen CD, Nguyen DT, Pham VH (2013) Parallel two-phase K-means, In Computational Science and Its Applications–ICCSA 2013: 13th International Conference, Ho Chi Minh City, Vietnam, June 24–27, Proceedings, Part V, 224–231
He Y, Tan H, Luo W, Feng S, Fan J (2014) MR-DBSCAN: a scalable MapReduce-based DBSCAN algorithm for heavily skewed data. Front Comp Sci 8:83–99
Shanmugapriya E, Kavitha R (2019) Efficient and secure privacy analysis for medical big data using TDES and MKSVM with access control in cloud. J Med Syst 43:1–12
Satar SDM, Hussin M, Hanapi ZM, Mohamed MA (2021) Cloud-based secure healthcare framework by using enhanced ciphertext policy attribute-based encryption scheme. (IJACSA) Int J Adv Comput Sci Appl 12(6):393–399
Wang Z, Gao W, Yang M, Hao R (2022) Enabling secure data sharing with data deduplication and sensitive information hiding in cloud-assisted Electronic Medical Systems. Clust Comput 28:1–6
Chandrakar I, Hulipalled VR (2022) Improved technique for preserving privacy while mining real time big data. Int J Commun Netw Inf Secur 14(1):86–92
Padmaja K, Seshadri R (2021) A real-time secure medical device authentication for personal E-Healthcare services on cloud computing. Int J Syst Assur Eng Manag. https://doi.org/10.1007/s13198-021-01148-1
Narayanan U, Paul V, Joseph S (2020) A novel system architecture for secure authentication and data sharing in cloud enabled big data environment. J King Saud Univ Comput Inf Sci 34(6):3121–3135
Shakil KA, Zareen FJ, Alam M, Jabin S (2020) BAMHealthCloud: A biometric authentication and data management system for healthcare data in cloud. J King Saud Univ Comput Inf Sci 32(1):57–64. https://doi.org/10.1016/j.jksuci.2017.07.001
Ogundoyin SO, Kamil IA (2021) PAASH: A privacy-preserving authentication and fine-grained access control of outsourced data for secure smart health in smart cities. J Parallel Distrib Comput 155:101–119
Masud M, Gaba GS, Choudhary K, Alroobaea R, Shamim Hossain M (2021) A robust and lightweight secure access scheme for cloud-based E-healthcare services. Peer-to-Peer Netw Appl 14(5):3043–3057
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Rajeshkumar, K., Dhanasekaran, S. & Vasudevan, V. A novel three-factor authentication and optimal mapreduce frameworks for secure medical big data transmission over the cloud with shaxecc. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-18147-6
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11042-024-18147-6