Log in

DHMFRD – TER: a deep hybrid model for fake review detection incorporating review texts, emotions, and ratings

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Recently, there has been an increasing reward to manipulate product/ service reviews, mostly profit-driven, since positive reviews infer high business returns and vice versa. To combat this issue, experts in industry and researchers recently attempted integrating multi-aspect (reviewer- and review-centric) data features. However, the emotions hidden in the review, the semantic meaning of the review, and data heterogeneity still deserve more study as they are essential indicators of fake content. This study proposed a Deep Hybrid Model for Fake Review Detection incorporating review Texts, Emotions, and Ratings (DHMFRD – TER). Initially, it computes contextualized review text vectors and extraction of emotion indicators representations. Then, the model learns the representation to extract higher-level review features. Finally, contextualized word vectors, ratings, and emotions are concatenated; such a multidimensional feature representation is used to classify reviews. Extensive experiments on three publicly available datasets demonstrate that DHMFRD–TER significantly outperforms state-of-the-art baseline approaches, achieving an accuracy of 0.988, 0.987, and 0.994 in Amazon, Yelp CHI, and OSF datasets, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. https://www.kaggle.com/lievgarcia/amazon-reviews

  2. http://odds.cs.stonybrook.edu/yelpchi-dataset/

  3. https://osf.io/tyue9/

  4. https://code.google.com/archive/p/stop-words/

References

  1. Khan ZY, Niu Z, Yousif A (2020) Joint Deep Recommendation Model Exploiting Reviews and Metadata Information. Neurocomputing 402:256–265. https://doi.org/10.1016/j.neucom.2020.03.075

    Article  Google Scholar 

  2. Manaskasemsak, B, Tantisuwankul, J, Rungsawang, A (2021) Fake review and reviewer detection through behavioral graph partitioning integrating deep neural network. Neural Comput Appl, 1. https://doi.org/10.1007/s00521-021-05948-1

  3. Duma R, Niu Z, Nyamawe A et al (2023) A Deep Hybrid Model for Fake review detection by jointly leveraging review text, overall ratings and aspects ratings. Soft Comput 1–24. https://doi.org/10.1007/s00500-023-07897-4

  4. Liu, Y, Wang, L, Shi, T, Li, J (2022) Detection of spam reviews through a hierarchical attention architecture with N-gram CNN and Bi-LSTM. Inf Syst, 103(xxxx). https://doi.org/10.1016/j.is.2021.101865

  5. Ren J, Yeoh W, Shan Ee M, Popovič A (2018) Online consumer reviews and sales: Examining the chicken-egg relationships. J Am Soc Inf Sci 69(3):449–460. https://doi.org/10.1002/asi.23967

    Article  Google Scholar 

  6. Bhuvaneshwari P, Rao AN, Robinson YH (2021) Spam review detection using self attention based CNN and bi-directional LSTM. Multimedia Tools and Applications 80(12):18107–18124. https://doi.org/10.1007/s11042-021-10602-y

    Article  Google Scholar 

  7. Luca M (2016) Reviews, reputation, and revenue: The Case of Yelp.com. Com (March 15, 2016) Harvard Bus Sch NOM Unit Work Pap 12–016

  8. Ellson, A (2018) 'A third of TripAdvisor reviews are fake' as cheats buy five stars | The Times, Accsessed December, 20 2021. https://www.thetimes.co.uk/article/hotel-and-caf-cheats-are-caught-trying-to-buy-tripadvisor-stars-027fbcwc8

  9. Salminen, J, Kandpal, C, Kamel, AM, Jung, SG, Jansen, BJ (2022) Creating and detecting fake reviews of online products. J Retail Consum Serv, 64(August 2021). https://doi.org/10.1016/j.jretconser.2021.102771

  10. Gretzel, U (2009) Information and Communication Technologies in Tourism 2009. Inf Commun Technol Tourism 2009. https://doi.org/10.1007/978-3-211-93971-0

  11. Harris CG (2012) Detecting deceptive opinion spam using human computation. Work Twenty-Sixth AAAI Conf Artif Intell WS-12-08:87–93

  12. Ott, M, Cardie, C, Hancock, JT (2013) Negative deceptive opinion spam. NAACL HLT 2013 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Main Conference, June, 497–501

  13. Barushka A, Hajek P (2020) Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks. Neural Comput Appl 32(9):4239–4257. https://doi.org/10.1007/s00521-019-04331-5

    Article  Google Scholar 

  14. Zhang W, Du Y, Yoshida T, Wang Q (2018) DRI-RCNN: An approach to deceptive review identification using recurrent convolutional neural network. Inf Process Manage 54(4):576–592. https://doi.org/10.1016/j.ipm.2018.03.007

    Article  Google Scholar 

  15. Heydari A, Tavakoli MA, Salim N, Heydari Z (2015) Detection of review spam: A survey. Expert Syst Appl 42(7):3634–3642. https://doi.org/10.1016/j.eswa.2014.12.029

    Article  Google Scholar 

  16. Dong L, yu, Ji, S. juan, Zhang, C. **, Zhang, Q., Chiu, D. K. W., Qiu, L. qing, & Li, D. (2018) An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews. Expert Syst Appl 114:210–223. https://doi.org/10.1016/j.eswa.2018.07.005

    Article  Google Scholar 

  17. Ghai R, Kumar S, Pandey AC (2019) Spam detection using rating and review processing method. Adv Intell Syst Comput 670:189–198. https://doi.org/10.1007/978-981-10-8971-8_18

    Article  Google Scholar 

  18. Ren Y, Ji D (2017) Neural networks for deceptive opinion spam detection: An empirical study. Inf Sci 385–386:213–224. https://doi.org/10.1016/j.ins.2017.01.015

    Article  Google Scholar 

  19. Zhang, Y, Wallace, B (2015) A Sensitivity Analysis of (and Practitioners' Guide to) Convolutional Neural Networks for Sentence Classification. http://arxiv.org/abs/1510.03820

  20. Barbado R, Araque O, Iglesias CA (2019) A framework for fake review detection in online consumer electronics retailers. Inf Process Manage 56(4):1234–1244. https://doi.org/10.1016/j.ipm.2019.03.002

    Article  Google Scholar 

  21. Budhi GS, Chiong R, Wang Z (2021) Resampling imbalanced data to detect fake reviews using machine learning classifiers and textual-based features. Multimed Tools Appl 80(9):13079–13097. https://doi.org/10.1007/s11042-020-10299-5

    Article  Google Scholar 

  22. Wang, X, Liu, K, Zhao, J (2017) Handling cold-start problem in review spam detection by jointly embedding texts and behaviors. ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), 1, 366–376. https://doi.org/10.18653/v1/P17-1034

  23. Li, Y, Feng, X, Zhang, S (2016) Detecting Fake Reviews Utilizing Semantic and Emotion Model. Proceedings - 2016 3rd International Conference on Information Science and Control Engineering, ICISCE 2016, 317–320. https://doi.org/10.1109/ICISCE.2016.77

  24. Saini M, Sharan A (2017) Identifying Deceptive Opinion Spam using Aspect-based Emotions and Human Behavior Modeling. Int J Hybrid Inf Technol 10(1):447–456. https://doi.org/10.14257/ijhit.2017.10.1.36

    Article  Google Scholar 

  25. Lauriola I, Lavelli A, Aiolli F (2022) An introduction to deep learning in natural language processing: models, techniques, and tools. Neurocomputing 470:443–456. https://doi.org/10.1016/j.neucom.2021.05.103

  26. Khan ZY, Niu Z (2021) CNN with depthwise separable convolutions and combined kernels for rating prediction. Exp Syst Appl 170(November2020):114528. https://doi.org/10.1016/j.eswa.2020.114528

    Article  Google Scholar 

  27. **dal, N, Liu, B (2008) Opinion spam and analysis. WSDM'08 - Proceedings of the 2008 International Conference on Web Search and Data Mining, 219–229. https://doi.org/10.1145/1341531.1341560

  28. Mewada A, Dewang RK (2021) Research on false review detection Methods: A state-of-the-art review. J King Saud Univ Comput Inf Sci. https://doi.org/10.1016/j.jksuci.2021.07.021

    Article  Google Scholar 

  29. Vidanagama DU, Silva TP, Karunananda AS (2020) Deceptive consumer review detection: a survey. Artif Intell Rev 53(2):1323–1352. https://doi.org/10.1007/s10462-019-09697-5

    Article  Google Scholar 

  30. **dal, N, Liu, B (2007) Review spam detection. 16th International World Wide Web Conference, WWW2007, 1189–1190. https://doi.org/10.1145/1242572.1242759

  31. Ott M, Choi Y, Cardie C, Hancock JT (2011) Finding deceptive opinion spam by any stretch of the imagination. ar**v Prepr ar**v11074557

  32. Li Jiwei, Ott M, Cardie C, Hovy E (2014) Towards a general rule for identifying deceptive opinion spam. 52nd Annual Meet Assoc Computa Linguist ACL 2014 Proceed Conf 1:1566–1576. https://doi.org/10.3115/v1/p14-1147

    Article  Google Scholar 

  33. Hajek P, Barushka A, Munk M (2020) Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining. Neural Comput Appl 32(23):17259–17274. https://doi.org/10.1007/s00521-020-04757-2

    Article  Google Scholar 

  34. Zhao, S, Xu, Z, Liu, L, Guo, M, Yun, J (2018) Towards Accurate Deceptive Opinions Detection Based on Word Order-Preserving CNN. Math Problems Eng, 2018. https://doi.org/10.1155/2018/2410206

  35. Li Jiandun, Lv P, **ao W, Yang L, Zhang P (2021) Exploring groups of opinion spam using sentiment analysis guided by nominated topics. Exp Syst Appl 171(January):114585. https://doi.org/10.1016/j.eswa.2021.114585

    Article  Google Scholar 

  36. Mukherjee A, Kumar A, Liu B, et al (2013) Spotting opinion spammers using behavioral footprints. Proc ACM SIGKDD Int Conf Knowl Discov Data Min Part F1288:632–640. https://doi.org/10.1145/2487575.2487580

  37. Fei G, Mukherjee A, Liu B, et al (2013) Exploiting burstiness in reviews for review spammer detection. Proc 7th Int Conf Weblogs Soc Media, ICWSM 2013 7:175–184

  38. Li H, Fei G, Wang S, Liu B, Shao W, Mukherjee A, Shao J (2017) Bimodal distribution and co-bursting in review spam detection. 26th International World Wide Web Conference. WWW 2017:1063–1072. https://doi.org/10.1145/3038912.3052582

    Article  Google Scholar 

  39. Li, F, Huang, M, Yang, Y, Zhu, X (2011) Learning to identify review spam. IJCAI Int Joint Conf Artif Intell, 2488–2493. https://doi.org/10.5591/978-1-57735-516-8/IJCAI11-414

  40. Lim, EP, Liu, B, Liu, B, Lauw, HW (2010) Detecting Product Review Spammers using Rating Behaviors. Proceedings of the 19th {ACM} Conference on Information and Knowledge Management, {CIKM} 2010, Toronto, Ontario, Canada, October 26–30, 2010 Management, {CIKM} 2010, Toronto, Ontario, Canada, October 26–30, 2010, 939--948. https://doi.org/10.1145/1871437.1871557

  41. Mukherjee, A, Liu, B, Glance, N (2012) Spotting fake reviewer groups in consumer reviews. WWW'12 - Proceedings of the 21st Annual Conference on World Wide Web, 191–200. https://doi.org/10.1145/2187836.2187863

  42. Melleng, A, Loughrey, AJ, Deepak, P (2019) Sentiment and emotion based text representation for fake reviews detection. International Conference Recent Advances in Natural Language Processing, RANLP, 2019-Septe(2019), 750–757. https://doi.org/10.26615/978-954-452-056-4_087

  43. Peng Q, Zhong M (2014) Detecting Spam Review through Sentiment Analysis. J Softw 9(8):2065–2072. https://doi.org/10.4304/jsw.9.8.2065-2072

    Article  Google Scholar 

  44. Asghar MZ, Ullah A, Ahmad S, Khan A (2020) Opinion spam detection framework using hybrid classification scheme. Soft Comput 24(5):3475–3498. https://doi.org/10.1007/s00500-019-04107-y

    Article  Google Scholar 

  45. Noekhah, S, Fouladfar, E, Salim, N (2014) A Novel Approach for Opinion Spam Detection in E-Commerce. 8th International Conference on E-Commerce with Focus on e-Trust

  46. Kokate, S, Tidke, B (2015) Fake Review and Brand Spam Detection using J48 Classifier. 6(4), 3523–3526

  47. Rajamohana SP, Umamaheswari K (2018) Hybrid approach of improved binary particle swarm optimization and shuffled frog lea** for feature selection. Comput Electr Eng 67:497–508. https://doi.org/10.1016/j.compeleceng.2018.02.015

    Article  Google Scholar 

  48. Jacob, MS, Rajendran, PS (2021) Fuzzy artificial bee colony-based CNN-LSTM and semantic feature for fake product review classification. February, 1–16. https://doi.org/10.1002/cpe.6539

  49. Devlin J, Chang MW, Lee K, Toutanova K (2018) BERT: Pre-training of deep bidirectional transformers for language understanding. ar**v Prepr ar**v181004805

  50. Kaliyar RK, Goswami A, Narang P (2021) FakeBERT: Fake news detection in social media with a BERT-based deep learning approach. Multimed Tools Appl 80(8):11765–11788. https://doi.org/10.1007/s11042-020-10183-2

    Article  Google Scholar 

  51. Palani B, Elango S, Viswanathan V (2021) CB - Fake : A multimodal deep learning framework for automatic fake news detection using capsule neural network and BERT (Issue 0123456789). Springer, US

    Google Scholar 

  52. Denecke, K (2008) Using SentiWordNet for multilingual sentiment analysis. Proceedings – Int Conf Data Eng, 507–512. https://doi.org/10.1109/ICDEW.2008.4498370

  53. Malik MSI, Hussain A (2017) Helpfulness of product reviews as a function of discrete positive and negative emotions. Comput Hum Behav 73(2010):290–302. https://doi.org/10.1016/j.chb.2017.03.053

    Article  Google Scholar 

  54. Bravo-Marquez F, Mendoza M, Poblete B (2014) Meta-level sentiment models for big social data analysis. Knowl-Based Syst 69(1):86–99. https://doi.org/10.1016/j.knosys.2014.05.016

    Article  Google Scholar 

  55. Hu, M, Liu, B (2004) Mining and summarizing customer reviews. KDD-2004 - Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 168–177. https://doi.org/10.1145/1014052.1014073

  56. Bravo-Marquez, F, Frank, E, Mohammad, SM, Pfahringer, B (2017) Determining Word-Emotion Associations from Tweets by Multi-label Classification. Proceedings - 2016 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2016, 536–539. https://doi.org/10.1109/WI.2016.0091

  57. Rehman AU, Malik AK, Raza B, Ali W (2019) A Hybrid CNN-LSTM Model for Improving Accuracy of Movie Reviews Sentiment Analysis. Multimed Tools Appl 78(18):26597–26613. https://doi.org/10.1007/s11042-019-07788-7

    Article  Google Scholar 

  58. Sagnika S, Mishra BSP, Meher SK (2021) An attention-based CNN-LSTM model for subjectivity detection in opinion-mining. Neural Comput Appl 33(24):17425–17438. https://doi.org/10.1007/s00521-021-06328-5

    Article  Google Scholar 

  59. Rayana, S, Akoglu, L (2015) Collective opinion spam detection: Bridging review networks and metadata. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015-Augus, 985–994. https://doi.org/10.1145/2783258.2783370

  60. Abadi M, Barham P, Chen J et al (2016) TensorFlow: a system for large-scale machine learning. Osdi 16:265–283

  61. Chollet F et al (2015) Keras: Deep learning for humans. https://github.com/fchollet/keras. Accessed 20 Jan 2022

  62. Li L, Qin B, Ren W, Liu T (2017) Document representation and feature combination for deceptive spam review detection. Neurocomputing 254:33–41. https://doi.org/10.1016/j.neucom.2016.10.080

    Article  Google Scholar 

  63. Li J, Fu Y, Xu R (2020) Improving Fake Product Detection with Aspect-Based Sentiment Analysis. Int Conf Cogn Comput, 39–49. https://doi.org/10.1007/978-3-030-59585-2_4

  64. Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30

    MathSciNet  Google Scholar 

  65. García S, Fernández A, Luengo J, Herrera F (2010) Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power. Inf Sci 180(10):2044–2064. https://doi.org/10.1016/j.ins.2009.12.010

    Article  Google Scholar 

  66. Luo, N, Deng, H, Zhao, L, Liu, Y, Wang, X, Tan, Z (2017) Multi-aspect feature based neural network model in detecting fake reviews. Proceedings - 2017 4th International Conference on Information Science and Control Engineering, ICISCE 2017, 475–479. https://doi.org/10.1109/ICISCE.2017.106

Download references

Funding

This work was supported by the National Natural Science Foundation of China under Grant 62272048.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhendong Niu.

Ethics declarations

Conflicts of Interest

The authors declare that there is no conflict of interest in this Manuscript.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Duma, R.A., Niu, Z., Nyamawe, A. et al. DHMFRD – TER: a deep hybrid model for fake review detection incorporating review texts, emotions, and ratings. Multimed Tools Appl 83, 4533–4549 (2024). https://doi.org/10.1007/s11042-023-15193-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-15193-4

Keywords

Navigation