Abstract
Sentiment analysis is the most effective way to understand opinions presented in digital media. Social sense is the concept of understanding every user's emotions connected through an online social media platform. This emotion helps to understand the mental health of people. Every stage of extracting social impact contains several problems with several methods associated with it, like lexicon-based techniques, which were proposed, developed, and evaluated but sometimes had poor accuracy. Machine Learning (ML) is useful to recognize patterns, in various aspects of sentiment analysis and it will best suited for this traditional algorithm where the large dataset is chunked into the smaller dataset to train the model. NLP methods do not account for different domains when modeling sentiment information. In a classification dilemma, reducing the feature set size diminishes the algorithm's time demand while improving the method's accuracy to obtain the optimal features. Scalability refers to handling large amounts of data and performing several computations like time and cost efficiently. This paper's perspective is to overcome the shortcomings of each system, and this article proposes a hybrid approach to sentiment analysis. Data has been collected from Twitter via the Twitter API. This research investigates the effects of negations, URLs, usernames, punctuation, repeated character normalization, and hashtags on sentiment classification output using an n-gram representation model, which is a kind of probabilistic language model and also integrates a cross-domain sentiment-aware learning model that can collect both sentiment awareness and domain validity of a word simultaneously to solve the problem of words from various domains. To fix the scalability problem, the article offers the novel Tunicate Swarm Algorithm (TSA), which increases scalability and reduces computation time. Meanwhile, this paper custom a hybrid Harris Hawks Optimization (HHO) algorithm based on simulated annealing (SA) and bitwise operations to solve the local optima problem. This proposed work results in the sentiment score that people with positive sentiments are more than those with negative. The proposed model gives better feature size reduction measured using the correlation coefficient metric and comparing other state-of-the-art algorithms like the wrapper approach, particle swarm optimization, and greedy feature selection with a reduction in feature size of up to 64%. The precision–recall curve ranges toward one, which means the classifier does classification accurately. In the sentiment analysis of Twitter feedback, the suggested solution has greater exactness and scalability with 96 s. The proposed model is well-trained, and the best validation is achieved at the 4th epoch with 0.6672. The area under the ROC curve evaluates the separability near one that shows the proposed model works excellently. The claim of a proposed model has been validated through comparison from different classifiers like Naïve Bayes, kNN, Random Forest, CNN-RNN, and AC-BiLSTM with admissible accuracy of 96.37%.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig7_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig8_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig9_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig10_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig11_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig12_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig13_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs42979-023-02017-3/MediaObjects/42979_2023_2017_Fig14_HTML.png)
Similar content being viewed by others
Data availability
The data will be available upon reasonable request from authors.
References
Uban AS, Chulvi B, Rosso P. An emotion and cognitive based analysis of mental health disorders from social media data. Future Gener Comput Syst. 2021. https://doi.org/10.1016/j.future.2021.05.032.
Iftekhar EN, Priesemann V, Balling R, Bauer S, Beutels P, Valdez AC, Willeit P. A look into the future of the COVID-19 pandemic in Europe: an expert consultation. Lancet Reg Health (Europe). 2021. https://doi.org/10.1016/j.lanepe.2021.100185.
Yin H, Song X, Yang S, Li J. Sentiment analysis and topic modeling for COVID-19 vaccine discussions. World Wide Web. 2022. https://doi.org/10.1007/s11280-022-01029-y.
Krueger T, Gogolewski K, Bodych M, Gambin A, Giordano G, Cuschieri S, Szczurek E. Risk assessment of COVID-19 epidemic resurgence in relation to SARS-CoV-2 variants and vaccination passes. Commun Med. 2022. https://doi.org/10.1038/s43856-022-00084-w.
Maatuk AM, Elberkawi EK, Aljawarneh S, Rashaideh H, Alharbi H. The COVID-19 pandemic and E-learning: challenges and opportunities from the perspective of students and instructors. J Comput High Educ. 2022. https://doi.org/10.1007/s12528-021-09274-2.
Linardon J, Messer M, Rodgers RF, Fuller-Tyszkiewicz M. A systematic sco** review of research on COVID-19 impacts on eating disorders: a critical appraisal of the evidence and recommendations for the field. Int J Eat Disorders. 2022. https://doi.org/10.1002/eat.23640.
Ntompras C, Drosatos G, Kaldoudi E. A high-resolution temporal and geospatial content analysis of Twitter posts related to the COVID-19 pandemic. J Comput Soc Sci. 2022. https://doi.org/10.1007/s42001-021-00150-8.
Misiejuk K, Scianna J, Kaliisa R, Vachuska K, Shaffer DW. Incorporating sentiment analysis with epistemic network analysis to enhance discourse analysis of twitter data. Commun Comput Inf Sci. 2021. https://doi.org/10.1007/978-3-030-67788-6_26.
Mahajan R, Mansotra V. Correlating crime and social media: using semantic sentiment analysis. Int J Adv Comput Sci Appl. 2021. https://doi.org/10.14569/IJACSA.2021.0120338.
Kokol P, Kokol M, Zagoranski S. Machine learning on small size samples: a synthetic knowledge synthesis. Sci Progr. 2022. https://doi.org/10.1177/00368504211029777.
Chang YC, Ku CH, Le Nguyen DD. Predicting aspect-based sentiment using deep learning and information visualization: the impact of COVID-19 on the airline industry. Inf Manag. 2022. https://doi.org/10.1016/j.im.2021.103587.
Sharaff A, Jain M, Modugula G. Feature based cluster ranking approach for single document summarization. Int J Inf Technol. 2022. https://doi.org/10.1007/s41870-021-00853-1.
Dubey AD. Public sentiment analysis of COVID-19 vaccination drive in India. SSRN Electron J. 2021. https://doi.org/10.2139/ssrn.3772401.
Chouhan RL. Sentiment analysis of Pulwama attack using twitter data. Lect Notes Netw Syst. 2021. https://doi.org/10.1007/978-981-15-5421-6_13.
Naseem U, Razzak I, Khushi M, Eklund PW, Kim J. COVID senti: a large-scale benchmark twitter data set for COVID-19 sentiment analysis. IEEE Trans Comput Soc Syst. 2021. https://doi.org/10.1109/TCSS.2021.3051189.
Hajiali M. Big data and sentiment analysis: a comprehensive and systematic literature review. Concurr Comput. 2020. https://doi.org/10.1002/cpe.5671.
Mathur A, Kubde P, Vaidya S. Emotional analysis using twitter data during pandemic situation: COVID-19. Int Conf Commun Electron Syst (ICCES). 2020. https://doi.org/10.1109/icces48766.2020.9138079.
Salur MU, Aydin I. A novel hybrid deep learning model for sentiment classification. IEEE Access. 2020. https://doi.org/10.1109/ACCESS.2020.2982538.
Soomro ZT, Ilyas SHW, Yaqub U. Sentiment, count and cases: analysis of twitter discussions during COVID-19 pandemic. Int Conf Behav Soc Comput (BESC). 2020. https://doi.org/10.1109/BESC51023.2020.9348291.
Sharaff A, Khaire AS, Sharma D. Analysing fuzzy based approach for extractive text summarization. Int Conf Intell Comput Control Syst (ICCS). 2019. https://doi.org/10.1109/ICCS45141.2019.9065722.
Mengistie TT, Kumar D. Deep learning based sentiment analysis on COVID-19 public reviews. Int Conf Artif Intell Inf Commun (ICAIIC). 2021. https://doi.org/10.1109/ICAIIC51459.2021.9415191.
Khan R, Rustam F, Kanwal K, Mehmood A, Choi GS. US Based COVID-19 tweets sentiment analysis using TextBlob and supervised machine learning algorithms. Int Conf Artif Intell (ICAI). 2021. https://doi.org/10.1109/ICAI52203.2021.9445207.
Gulati K, Kumar SS, Boddu RSK, Sarvakar K, Sharma DK, Nomani MZM. Comparative analysis of machine learning-based classification models using sentiment classification of tweets related to COVID-19 pandemic. Mater Today. 2021. https://doi.org/10.1016/j.matpr.2021.04.364.
Sitaula C, Basnet A, Mainali A, Shahi TB. Deep learning-based methods for sentiment analysis on Nepali COVID-19-related tweets. Comput Intell Neurosci. 2021. https://doi.org/10.1155/2021/2158184.
Chakraborty K, Bhatia S, Bhattacharyya S, Platos J, Bag R, Hassanien AE. Sentiment analysis of COVID-19 tweets by deep learning classifiers—a study to show how popularity is affecting accuracy in social media. Appl Soft Comput. 2020. https://doi.org/10.1016/j.asoc.2020.106754.
Han H, Zhang Y, Zhang J, Yang J, Zou X. Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias. PLoS ONE. 2018. https://doi.org/10.1371/journal.pone.0202523.
L’heureux A, Grolinger K, Elyamany HF, Capretz MA. Machine learning with big data: challenges and approaches. IEEE Access. 2017. https://doi.org/10.1109/ACCESS.2017.2696365.
Kaur S, Awasthi LK, Sangal AL, Dhiman G. Tunicate Swarm Algorithm: a new bio-inspired based metaheuristic paradigm for global optimization. Eng Appl Artif Intell. 2020. https://doi.org/10.1016/j.engappai.2020.103541.
Sharifi MR, Akbarifard S, Qaderi K, Madadi MR. Comparative analysis of some evolutionary-based models in optimization of dam reservoirs operation. Sci Rep. 2021. https://doi.org/10.1038/s41598-021-95159-4.
Abdel-Basset M, Ding W, El-Shahat D. A hybrid Harris Hawks optimization algorithm with simulated annealing for feature selection. Artif Intell Rev. 2021. https://doi.org/10.1007/s10462-020-09860-3.
Nagwani NK, Sharaff A. SMS spam filtering and thread identification using bi-level text classification and clustering techniques. J Inf Sci. 2017. https://doi.org/10.1177/0165551515616310.
El-Hasnony IM, Elhoseny M, Tarek Z. A hybrid feature selection model based on butterfly optimization algorithm: COVID-19 as a case study. Expert Syst. 2022. https://doi.org/10.1111/exsy.12786.
Lochter JV, Silva RM, Almeida TA. Deep learning models for representing out-of-vocabulary words. Lect Notes Comput Sci. 2020. https://doi.org/10.1007/978-3-030-61377-8_29.
Acknowledgements
The author is grateful to the National Institute of Technology Raipur, India to provide every essential infrastructure and support to carry out this research.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the topical collection “Enabling Innovative Computational Intelligence Technologies for IOT” guest edited by Omer Rana, Rajiv Misra, Alexander Pfeiffer, Luigi Troiano and Nishtha Kesswani.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Seth, R., Sharaff, A. Sentiment Data Analysis for Detecting Social Sense after COVID-19 using Hybrid Optimization Method. SN COMPUT. SCI. 4, 568 (2023). https://doi.org/10.1007/s42979-023-02017-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-023-02017-3