Log in

Systematic approach for fake news detection using machine learning

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Thanks to the advent of W3C and the rapid proliferation of social media platforms (Facebook, Instagram, Twitter, etc.), an unprecedented level of knowledge sharing has become possible. Users are producing more n more information and it is being circulating to a large number of people. The produced information may not be correct all the time. Even subject matter experts need to judge an array of variables before evaluating to check the trustworthiness. In this study, we propose to use machine learning (ML) to automate message classification. Our research examines various criteria that can be used to differentiate between genuine materials and counterfeits. These qualities are used for training machine learning algorithms, and we then use datasets from the real world to evaluate how well they work. The result shows that Xgboost model gives the highest accuracy for fake news detection in both Tf-Idf and BOW feature extraction techniques.SVM and Multinomial Naïve Base models are the most underperforming models in Tf-Idf and BOW respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Douglas A (2006) News consumption and the new electronic media. 7e Intl J Press/Politics 11(1):29–52

    Article  Google Scholar 

  2. Wong J (2016) Almost all the traffic to fake news sites is from facebook, new data show

  3. Lazer DMJ, Baum MA, Benkler Y et al (2018) The science of fake news. Science 359(6380):1094–1096

    Article  Google Scholar 

  4. S. A. Garc´ıa, G. G. Garc´ıa, M. S. Prieto, A. J. M. Guerrero, and C. R. Jim´enez, “)e impact of term fake news on the scientific community scientific performance and map** in web of science,” Social Sciences, 9(5) 2020

  5. A. Douglas, “News consumption and the new electronic media,” 7e International Journal of Press/Politics, vol. 11, no. 1, pp. 29–52, 2006.

  6. Granik M, Mesyura V (2017) Fake news detection using naive Bayes classifier,” 2017 IEEE 1st Ukr. Conf. Electr. Comput. Eng. UKRCON 2017 - Proc., 900–903

  7. Martínez-Garcia A, Morris S, Tscholl M, Tracy F, Carmichael P (2012) Case-based learning, pedagogical innovation, and semantic web technologies. IEEE Trans Learn Technol 5(2):104–116

    Article  Google Scholar 

  8. Jain A, Shakya A, Khatter H, Gupta AK (2019) A smart System for Fake News Detection Using Machine Learning. Intl Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT) 2019:1–4. https://doi.org/10.1109/ICICT46931.2019.8977659

    Article  Google Scholar 

  9. Manzoor SI, Singla J, Nikita (2019) Fake News Detection Using Machine Learning approaches: A systematic Review, 2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI), 230–234, doi: https://doi.org/10.1109/ICOEI.2019.8862770.

  10. Khanam Z, Alwasel B, Sirafi H, Rashid M (2021) Fake News Detection Using Machine Learning Approaches. IOP Conference Series: Mater Sci Eng 1099:012040. https://doi.org/10.1088/1757-899X/1099/1/012040

    Article  Google Scholar 

  11. Allcott H, Gentzkow M (2017) Social media and fake news in the 2016 election. J Econ Perspect 31(2):211–236

    Article  Google Scholar 

  12. Mathieu Cliche (2014) The sarcasm detector

  13. William Yang Wang (2017) liar, liar pants on fire: A new benchmark dataset for fake news detection. ar**v preprint ar**v:1705.00648

  14. Shu K, Sliva A, Wang S, Tang J, Liu H (2017) Fake news detection on social media. ACM SIGKDD Explorations Newsl 19(1):22–36

    Article  Google Scholar 

  15. Vosoughi S, Roy D, Aral S (2018) )e spread of true and false news online. Science 359(6380):1146–1151

    Article  Google Scholar 

  16. Rubin VL, Conroy N, Chen Y, Cornwell S (2016) Fake news or truth? using satirical cues to detect potentially misleading news, in Proceedings of the Second Workshop on Computational Approaches to Deception Detection, pp. 7–17, San Diego, CA, USA

  17. Jwa H, Oh D, Park K, Kang JM, Lim H (2019) exBAKE: automatic fake news detection model based on bidirectional encoder representations from transformers (bert), Appl Sci, 9(19)

  18. Reddy A, Vasundhara D, Subhash P (2019) Sentiment Research on Twitter Data. Int J Recent Technol Eng 8:1068–1070

    Google Scholar 

  19. Eshan SC and Hasan MS (2017) An application of machine learning to detect abusive bengali text. In Proceedings of the 2017 20th International Conference of Computer and Information Technology (ICCIT), Dhaka, Bangladesh, 22–24 December 2017; IEEE: New York, NY, USA, 1–6

  20. Zhang W, Yoshida T, Tang X (2011) A comparative study of TF* IDF, LSI and multi-words for text classification. Expert Syst Appl 38:2758–2765

    Article  Google Scholar 

  21. AnithaElavarasi S, Jayanthi J, Basker N (2021) A comparative study on logistic regression and svm based machine learning approach for analyzing user reviews. Turk J Physiother Rehabil 32:3564–3570

    Google Scholar 

  22. Donges, N. He Random Forest Algorithm (2021) Available online: https://builtin.com/data-science/random-forest-algorithm (accessed on 2 August 2022).

  23. Safavian SR, Landgrebe D (1991) A survey of decision tree classifier methodology. IEEE Trans Syst Man Cybern 213:660–674

    Article  MathSciNet  Google Scholar 

  24. Zainuddin N, Selamat A (2014) Sentiment analysis using support vector machine. In Proceedings of the 2014 International Conference on Computer, Communications, and Control Technology (I4CT), Langkawi, Malaysia, 2–4 September 2014 IEEE: New York, NY, USA, 333–337

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pankaj Kumar Varshney.

Ethics declarations

Conflict of interest

Author Pankaj Kumar Varshney declares that he has no conflict of interest. Author Ganesh Kumar Wadhwani declares that he has no conflict of.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Varshney, P.K., Wadhwani, G.K. Systematic approach for fake news detection using machine learning. Multimed Tools Appl (2023). https://doi.org/10.1007/s11042-023-17913-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11042-023-17913-2

Keywords

Navigation