Abstract
Thanks to the advent of W3C and the rapid proliferation of social media platforms (Facebook, Instagram, Twitter, etc.), an unprecedented level of knowledge sharing has become possible. Users are producing more n more information and it is being circulating to a large number of people. The produced information may not be correct all the time. Even subject matter experts need to judge an array of variables before evaluating to check the trustworthiness. In this study, we propose to use machine learning (ML) to automate message classification. Our research examines various criteria that can be used to differentiate between genuine materials and counterfeits. These qualities are used for training machine learning algorithms, and we then use datasets from the real world to evaluate how well they work. The result shows that Xgboost model gives the highest accuracy for fake news detection in both Tf-Idf and BOW feature extraction techniques.SVM and Multinomial Naïve Base models are the most underperforming models in Tf-Idf and BOW respectively.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-023-17913-2/MediaObjects/11042_2023_17913_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-023-17913-2/MediaObjects/11042_2023_17913_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-023-17913-2/MediaObjects/11042_2023_17913_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-023-17913-2/MediaObjects/11042_2023_17913_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-023-17913-2/MediaObjects/11042_2023_17913_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-023-17913-2/MediaObjects/11042_2023_17913_Fig6_HTML.png)
Similar content being viewed by others
References
Douglas A (2006) News consumption and the new electronic media. 7e Intl J Press/Politics 11(1):29–52
Wong J (2016) Almost all the traffic to fake news sites is from facebook, new data show
Lazer DMJ, Baum MA, Benkler Y et al (2018) The science of fake news. Science 359(6380):1094–1096
S. A. Garc´ıa, G. G. Garc´ıa, M. S. Prieto, A. J. M. Guerrero, and C. R. Jim´enez, “)e impact of term fake news on the scientific community scientific performance and map** in web of science,” Social Sciences, 9(5) 2020
A. Douglas, “News consumption and the new electronic media,” 7e International Journal of Press/Politics, vol. 11, no. 1, pp. 29–52, 2006.
Granik M, Mesyura V (2017) Fake news detection using naive Bayes classifier,” 2017 IEEE 1st Ukr. Conf. Electr. Comput. Eng. UKRCON 2017 - Proc., 900–903
Martínez-Garcia A, Morris S, Tscholl M, Tracy F, Carmichael P (2012) Case-based learning, pedagogical innovation, and semantic web technologies. IEEE Trans Learn Technol 5(2):104–116
Jain A, Shakya A, Khatter H, Gupta AK (2019) A smart System for Fake News Detection Using Machine Learning. Intl Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT) 2019:1–4. https://doi.org/10.1109/ICICT46931.2019.8977659
Manzoor SI, Singla J, Nikita (2019) Fake News Detection Using Machine Learning approaches: A systematic Review, 2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI), 230–234, doi: https://doi.org/10.1109/ICOEI.2019.8862770.
Khanam Z, Alwasel B, Sirafi H, Rashid M (2021) Fake News Detection Using Machine Learning Approaches. IOP Conference Series: Mater Sci Eng 1099:012040. https://doi.org/10.1088/1757-899X/1099/1/012040
Allcott H, Gentzkow M (2017) Social media and fake news in the 2016 election. J Econ Perspect 31(2):211–236
Mathieu Cliche (2014) The sarcasm detector
William Yang Wang (2017) liar, liar pants on fire: A new benchmark dataset for fake news detection. ar**v preprint ar**v:1705.00648
Shu K, Sliva A, Wang S, Tang J, Liu H (2017) Fake news detection on social media. ACM SIGKDD Explorations Newsl 19(1):22–36
Vosoughi S, Roy D, Aral S (2018) )e spread of true and false news online. Science 359(6380):1146–1151
Rubin VL, Conroy N, Chen Y, Cornwell S (2016) Fake news or truth? using satirical cues to detect potentially misleading news, in Proceedings of the Second Workshop on Computational Approaches to Deception Detection, pp. 7–17, San Diego, CA, USA
Jwa H, Oh D, Park K, Kang JM, Lim H (2019) exBAKE: automatic fake news detection model based on bidirectional encoder representations from transformers (bert), Appl Sci, 9(19)
Reddy A, Vasundhara D, Subhash P (2019) Sentiment Research on Twitter Data. Int J Recent Technol Eng 8:1068–1070
Eshan SC and Hasan MS (2017) An application of machine learning to detect abusive bengali text. In Proceedings of the 2017 20th International Conference of Computer and Information Technology (ICCIT), Dhaka, Bangladesh, 22–24 December 2017; IEEE: New York, NY, USA, 1–6
Zhang W, Yoshida T, Tang X (2011) A comparative study of TF* IDF, LSI and multi-words for text classification. Expert Syst Appl 38:2758–2765
AnithaElavarasi S, Jayanthi J, Basker N (2021) A comparative study on logistic regression and svm based machine learning approach for analyzing user reviews. Turk J Physiother Rehabil 32:3564–3570
Donges, N. He Random Forest Algorithm (2021) Available online: https://builtin.com/data-science/random-forest-algorithm (accessed on 2 August 2022).
Safavian SR, Landgrebe D (1991) A survey of decision tree classifier methodology. IEEE Trans Syst Man Cybern 213:660–674
Zainuddin N, Selamat A (2014) Sentiment analysis using support vector machine. In Proceedings of the 2014 International Conference on Computer, Communications, and Control Technology (I4CT), Langkawi, Malaysia, 2–4 September 2014 IEEE: New York, NY, USA, 333–337
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Author Pankaj Kumar Varshney declares that he has no conflict of interest. Author Ganesh Kumar Wadhwani declares that he has no conflict of.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Varshney, P.K., Wadhwani, G.K. Systematic approach for fake news detection using machine learning. Multimed Tools Appl (2023). https://doi.org/10.1007/s11042-023-17913-2
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11042-023-17913-2