Log in

Sentiment Data Analysis for Detecting Social Sense after COVID-19 using Hybrid Optimization Method

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

Sentiment analysis is the most effective way to understand opinions presented in digital media. Social sense is the concept of understanding every user's emotions connected through an online social media platform. This emotion helps to understand the mental health of people. Every stage of extracting social impact contains several problems with several methods associated with it, like lexicon-based techniques, which were proposed, developed, and evaluated but sometimes had poor accuracy. Machine Learning (ML) is useful to recognize patterns, in various aspects of sentiment analysis and it will best suited for this traditional algorithm where the large dataset is chunked into the smaller dataset to train the model. NLP methods do not account for different domains when modeling sentiment information. In a classification dilemma, reducing the feature set size diminishes the algorithm's time demand while improving the method's accuracy to obtain the optimal features. Scalability refers to handling large amounts of data and performing several computations like time and cost efficiently. This paper's perspective is to overcome the shortcomings of each system, and this article proposes a hybrid approach to sentiment analysis. Data has been collected from Twitter via the Twitter API. This research investigates the effects of negations, URLs, usernames, punctuation, repeated character normalization, and hashtags on sentiment classification output using an n-gram representation model, which is a kind of probabilistic language model and also integrates a cross-domain sentiment-aware learning model that can collect both sentiment awareness and domain validity of a word simultaneously to solve the problem of words from various domains. To fix the scalability problem, the article offers the novel Tunicate Swarm Algorithm (TSA), which increases scalability and reduces computation time. Meanwhile, this paper custom a hybrid Harris Hawks Optimization (HHO) algorithm based on simulated annealing (SA) and bitwise operations to solve the local optima problem. This proposed work results in the sentiment score that people with positive sentiments are more than those with negative. The proposed model gives better feature size reduction measured using the correlation coefficient metric and comparing other state-of-the-art algorithms like the wrapper approach, particle swarm optimization, and greedy feature selection with a reduction in feature size of up to 64%. The precision–recall curve ranges toward one, which means the classifier does classification accurately. In the sentiment analysis of Twitter feedback, the suggested solution has greater exactness and scalability with 96 s. The proposed model is well-trained, and the best validation is achieved at the 4th epoch with 0.6672. The area under the ROC curve evaluates the separability near one that shows the proposed model works excellently. The claim of a proposed model has been validated through comparison from different classifiers like Naïve Bayes, kNN, Random Forest, CNN-RNN, and AC-BiLSTM with admissible accuracy of 96.37%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Data availability

The data will be available upon reasonable request from authors.

References

  1. Uban AS, Chulvi B, Rosso P. An emotion and cognitive based analysis of mental health disorders from social media data. Future Gener Comput Syst. 2021. https://doi.org/10.1016/j.future.2021.05.032.

    Article  Google Scholar 

  2. Iftekhar EN, Priesemann V, Balling R, Bauer S, Beutels P, Valdez AC, Willeit P. A look into the future of the COVID-19 pandemic in Europe: an expert consultation. Lancet Reg Health (Europe). 2021. https://doi.org/10.1016/j.lanepe.2021.100185.

    Article  Google Scholar 

  3. Yin H, Song X, Yang S, Li J. Sentiment analysis and topic modeling for COVID-19 vaccine discussions. World Wide Web. 2022. https://doi.org/10.1007/s11280-022-01029-y.

    Article  Google Scholar 

  4. Krueger T, Gogolewski K, Bodych M, Gambin A, Giordano G, Cuschieri S, Szczurek E. Risk assessment of COVID-19 epidemic resurgence in relation to SARS-CoV-2 variants and vaccination passes. Commun Med. 2022. https://doi.org/10.1038/s43856-022-00084-w.

    Article  Google Scholar 

  5. Maatuk AM, Elberkawi EK, Aljawarneh S, Rashaideh H, Alharbi H. The COVID-19 pandemic and E-learning: challenges and opportunities from the perspective of students and instructors. J Comput High Educ. 2022. https://doi.org/10.1007/s12528-021-09274-2.

    Article  Google Scholar 

  6. Linardon J, Messer M, Rodgers RF, Fuller-Tyszkiewicz M. A systematic sco** review of research on COVID-19 impacts on eating disorders: a critical appraisal of the evidence and recommendations for the field. Int J Eat Disorders. 2022. https://doi.org/10.1002/eat.23640.

    Article  Google Scholar 

  7. Ntompras C, Drosatos G, Kaldoudi E. A high-resolution temporal and geospatial content analysis of Twitter posts related to the COVID-19 pandemic. J Comput Soc Sci. 2022. https://doi.org/10.1007/s42001-021-00150-8.

    Article  Google Scholar 

  8. Misiejuk K, Scianna J, Kaliisa R, Vachuska K, Shaffer DW. Incorporating sentiment analysis with epistemic network analysis to enhance discourse analysis of twitter data. Commun Comput Inf Sci. 2021. https://doi.org/10.1007/978-3-030-67788-6_26.

    Article  Google Scholar 

  9. Mahajan R, Mansotra V. Correlating crime and social media: using semantic sentiment analysis. Int J Adv Comput Sci Appl. 2021. https://doi.org/10.14569/IJACSA.2021.0120338.

    Article  Google Scholar 

  10. Kokol P, Kokol M, Zagoranski S. Machine learning on small size samples: a synthetic knowledge synthesis. Sci Progr. 2022. https://doi.org/10.1177/00368504211029777.

    Article  Google Scholar 

  11. Chang YC, Ku CH, Le Nguyen DD. Predicting aspect-based sentiment using deep learning and information visualization: the impact of COVID-19 on the airline industry. Inf Manag. 2022. https://doi.org/10.1016/j.im.2021.103587.

    Article  Google Scholar 

  12. Sharaff A, Jain M, Modugula G. Feature based cluster ranking approach for single document summarization. Int J Inf Technol. 2022. https://doi.org/10.1007/s41870-021-00853-1.

    Article  Google Scholar 

  13. Dubey AD. Public sentiment analysis of COVID-19 vaccination drive in India. SSRN Electron J. 2021. https://doi.org/10.2139/ssrn.3772401.

    Article  Google Scholar 

  14. Chouhan RL. Sentiment analysis of Pulwama attack using twitter data. Lect Notes Netw Syst. 2021. https://doi.org/10.1007/978-981-15-5421-6_13.

    Article  Google Scholar 

  15. Naseem U, Razzak I, Khushi M, Eklund PW, Kim J. COVID senti: a large-scale benchmark twitter data set for COVID-19 sentiment analysis. IEEE Trans Comput Soc Syst. 2021. https://doi.org/10.1109/TCSS.2021.3051189.

    Article  Google Scholar 

  16. Hajiali M. Big data and sentiment analysis: a comprehensive and systematic literature review. Concurr Comput. 2020. https://doi.org/10.1002/cpe.5671.

    Article  Google Scholar 

  17. Mathur A, Kubde P, Vaidya S. Emotional analysis using twitter data during pandemic situation: COVID-19. Int Conf Commun Electron Syst (ICCES). 2020. https://doi.org/10.1109/icces48766.2020.9138079.

    Article  Google Scholar 

  18. Salur MU, Aydin I. A novel hybrid deep learning model for sentiment classification. IEEE Access. 2020. https://doi.org/10.1109/ACCESS.2020.2982538.

    Article  Google Scholar 

  19. Soomro ZT, Ilyas SHW, Yaqub U. Sentiment, count and cases: analysis of twitter discussions during COVID-19 pandemic. Int Conf Behav Soc Comput (BESC). 2020. https://doi.org/10.1109/BESC51023.2020.9348291.

    Article  Google Scholar 

  20. Sharaff A, Khaire AS, Sharma D. Analysing fuzzy based approach for extractive text summarization. Int Conf Intell Comput Control Syst (ICCS). 2019. https://doi.org/10.1109/ICCS45141.2019.9065722.

    Article  Google Scholar 

  21. Mengistie TT, Kumar D. Deep learning based sentiment analysis on COVID-19 public reviews. Int Conf Artif Intell Inf Commun (ICAIIC). 2021. https://doi.org/10.1109/ICAIIC51459.2021.9415191.

    Article  Google Scholar 

  22. Khan R, Rustam F, Kanwal K, Mehmood A, Choi GS. US Based COVID-19 tweets sentiment analysis using TextBlob and supervised machine learning algorithms. Int Conf Artif Intell (ICAI). 2021. https://doi.org/10.1109/ICAI52203.2021.9445207.

    Article  Google Scholar 

  23. Gulati K, Kumar SS, Boddu RSK, Sarvakar K, Sharma DK, Nomani MZM. Comparative analysis of machine learning-based classification models using sentiment classification of tweets related to COVID-19 pandemic. Mater Today. 2021. https://doi.org/10.1016/j.matpr.2021.04.364.

    Article  Google Scholar 

  24. Sitaula C, Basnet A, Mainali A, Shahi TB. Deep learning-based methods for sentiment analysis on Nepali COVID-19-related tweets. Comput Intell Neurosci. 2021. https://doi.org/10.1155/2021/2158184.

    Article  Google Scholar 

  25. Chakraborty K, Bhatia S, Bhattacharyya S, Platos J, Bag R, Hassanien AE. Sentiment analysis of COVID-19 tweets by deep learning classifiers—a study to show how popularity is affecting accuracy in social media. Appl Soft Comput. 2020. https://doi.org/10.1016/j.asoc.2020.106754.

    Article  Google Scholar 

  26. Han H, Zhang Y, Zhang J, Yang J, Zou X. Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias. PLoS ONE. 2018. https://doi.org/10.1371/journal.pone.0202523.

    Article  Google Scholar 

  27. L’heureux A, Grolinger K, Elyamany HF, Capretz MA. Machine learning with big data: challenges and approaches. IEEE Access. 2017. https://doi.org/10.1109/ACCESS.2017.2696365.

    Article  Google Scholar 

  28. Kaur S, Awasthi LK, Sangal AL, Dhiman G. Tunicate Swarm Algorithm: a new bio-inspired based metaheuristic paradigm for global optimization. Eng Appl Artif Intell. 2020. https://doi.org/10.1016/j.engappai.2020.103541.

    Article  Google Scholar 

  29. Sharifi MR, Akbarifard S, Qaderi K, Madadi MR. Comparative analysis of some evolutionary-based models in optimization of dam reservoirs operation. Sci Rep. 2021. https://doi.org/10.1038/s41598-021-95159-4.

    Article  Google Scholar 

  30. Abdel-Basset M, Ding W, El-Shahat D. A hybrid Harris Hawks optimization algorithm with simulated annealing for feature selection. Artif Intell Rev. 2021. https://doi.org/10.1007/s10462-020-09860-3.

    Article  Google Scholar 

  31. Nagwani NK, Sharaff A. SMS spam filtering and thread identification using bi-level text classification and clustering techniques. J Inf Sci. 2017. https://doi.org/10.1177/0165551515616310.

    Article  Google Scholar 

  32. El-Hasnony IM, Elhoseny M, Tarek Z. A hybrid feature selection model based on butterfly optimization algorithm: COVID-19 as a case study. Expert Syst. 2022. https://doi.org/10.1111/exsy.12786.

    Article  Google Scholar 

  33. Lochter JV, Silva RM, Almeida TA. Deep learning models for representing out-of-vocabulary words. Lect Notes Comput Sci. 2020. https://doi.org/10.1007/978-3-030-61377-8_29.

    Article  Google Scholar 

Download references

Acknowledgements

The author is grateful to the National Institute of Technology Raipur, India to provide every essential infrastructure and support to carry out this research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Aakanksha Sharaff.

Ethics declarations

Conflict of Interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the topical collection “Enabling Innovative Computational Intelligence Technologies for IOT” guest edited by Omer Rana, Rajiv Misra, Alexander Pfeiffer, Luigi Troiano and Nishtha Kesswani.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Seth, R., Sharaff, A. Sentiment Data Analysis for Detecting Social Sense after COVID-19 using Hybrid Optimization Method. SN COMPUT. SCI. 4, 568 (2023). https://doi.org/10.1007/s42979-023-02017-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-023-02017-3

Keywords

Navigation