Log in

Finding the reference text in citation contexts using attention model

  • Special Issue Paper
  • Published:
Service Oriented Computing and Applications Aims and scope Submit manuscript

Abstract

Precise reference text extraction from citation contexts (CCs) is important in computational linguistics and information retrieval applications. The extraction of CCs forms the basis of critical tasks like citation network analysis and literature search/recommendation systems, all of which hinge on the fidelity of extracted reference information. However, traditional methods, often relying on full sentences or fixed-window approaches, suffer from unnecessary information inclusion and negatively affecting accuracy. In this study, we aim to bridge this gap by introducing a novel deep learning approach utilizing an Attention Model to directly extract reference text from CCs. This eliminates the need to sift through lengthy surrounding text. The model was trained on a dataset consisting of 100 cited papers from the fields of Natural Language Processing and Computational Linguistics. Each paper was converted into a text file and sequentially loaded for training. Tokenization was applied to convert textual data into numerical values. In the training phase, we used the Adam optimizer and adjusted batch sizes based on the number of citations in each paper. The proposed model obtained promising results, achieving a macro-F1 score of 0.87. This demonstrates its superior performance compared to standard benchmark techniques such as conditional random field and dependency parsing. Additionally, our model outperformed the benchmark techniques in terms of precision, recall, and F-score.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Canada)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Notes

  1. https://aan.how/.

  2. https://aan.how/about/.

References

  1. Jebari C, Herrera-Viedma E, Cobo MJ (2023) Context-aware citation recommendation of scientific papers: comparative study, gaps and trends. Scientometrics 1288:4243–4268

    Article  Google Scholar 

  2. Suganya E, Vijayarani S (2021) Firefly optimization algorithm based web scra** for web citation extraction. Wirel Pers Commun 1182:1481–1505

    Article  Google Scholar 

  3. Caragea C, Bulgarov F, Godea A, Gollapalli SD (2014) Citation-enhanced keyphrase extraction from research papers: a supervised approach. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1435–1446

  4. Liu S, Chen C (2013) The differences between latent topics in abstracts and citation contexts of citing papers. J Am Soc Inform Sci Technol 643:627–639

    Article  Google Scholar 

  5. Liang Y, Li Q (2011) Finding relevant papers based on citation relations. In: Web-age information management: 12th international conference, WAIM 2011, Wuhan, China, September 14–16, 2011. Proceedings 12. Springer, Berlin, pp 403–414

  6. Bertin M, Jonin P, Armetta F, Atanassova I (2019) Identifying the conceptual space of citation contexts using coreferences. In: 4th joint workshop on bibliometric-enhanced information retrieval and natural language processing for digital libraries (BIRNDL 2019) at the 42nd international ACM SIGIR conference on research and development in information retrieval, vol 2414. CEUR-WS, pp 138–144

  7. Cohan A, Goharian N (2017) Scientific article summarization using citation-context and article’s discourse structure. ar**v:1704.06619

  8. Jha R, Jbara A-A, Qazvinian V, Radev DR (2017) NLP-driven citation analysis for scientometrics. Nat Lang Eng 231:93–130

    Article  Google Scholar 

  9. Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate. ar**v:1409.0473

  10. Garfield E (1965) Can citation indexing be automated. In: Statistical association methods for mechanized documentation, symposium proceedings, vol 269. Citeseer, pp 189–192

  11. Weinatoek M (1971) Citation indexes. Encycl Libr Inf Sci 5:16–40

    Google Scholar 

  12. Wolfram D (2016) Bibliometrics, information retrieval and natural language processing: natural synergies to support digital library research. In: Proceedings of the joint workshop on bibliometric-enhanced information retrieval and natural language processing for digital libraries (BIRNDL), pp 6–13

  13. Moro R, Vangel M, Bielikova M (2016) Identification of navigation lead candidates using citation and co-citation analysis. In: SOFSEM 2016: theory and practice of computer science: 42nd international conference on current trends in theory and practice of computer science, Harrachov, Czech Republic, January 23–28, 2016, proceedings 42. Springer, Berlin, pp 556–568

  14. Bingol O.H, Doslu M (2018) Content sensitive document ranking method by analyzing the citation contexts. Google Patents. US Patent 10,157,225

  15. Qazvinian V, Radev D (2010) Identifying non-explicit citing sentences for citation-based summarization. In: Proceedings of the 48th annual meeting of the association for computational linguistics, pp 555–564

  16. Abu-Jbara A, Radev D (2011) Coherent citation-based summarization of scientific papers. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, pp 500–509

  17. Jha R, Jbara A-A, Qazvinian V, Radev DR (2017) NLP-driven citation analysis for scientometrics. Nat Lang Eng 231:93–130

    Article  Google Scholar 

  18. Ma S, Zhang C, Liu X (2020) A review of citation recommendation: from textual content to enriched context. Scientometrics 122:1445–1472

    Article  Google Scholar 

  19. Dum, D, Sutton C, Klein, E (2016) Context matters: towards extracting a citation’s context using linguistic features. In: Proceedings of the 16th ACM/IEEE-CS on joint conference on digital libraries, pp 201–202

  20. Ebesu T, Fang Y (2017) Neural citation network for context-aware citation recommendation. In: Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval, pp 1093–1096

  21. Tuarob S, Mitra P, Giles CL (2013) A classification scheme for algorithm citation function in scholarly works. In: Proceedings of the 13th ACM/IEEE-CS joint conference on digital libraries, pp 367–368

  22. Li X, He Y, Meyers A, Grishman R (2013) Towards fine-grained citation function classification. In: Proceedings of the international conference recent advances in natural language processing RANLP 2013, pp 402–407

  23. Pride D, Knoth P (2017) Incidental or influential?-challenges in automatically detecting citation importance using publication full texts. In: Research and advanced technology for digital libraries: 21st international conference on theory and practice of digital libraries, TPDL 2017, Thessaloniki, Greece, September 18–21, 2017, Proceedings 21. Springer, Berlin, pp 572–578

  24. Valenzuela M, Ha V, Etzioni O (2015) Identifying meaningful citations. In: AAAI workshop: scholarly big data, vol 15, p 13

  25. Ahmad I, Alqarni MA, Almazroi AA, Tariq A (2020) Experimental evaluation of clickbait detection using machine learning models. Intell Autom Soft Comput 266:1335–1344

    Article  Google Scholar 

  26. Zhang H, Song H, Li S, Zhou M, Song D (2023) A survey of controllable text generation using transformer-based pre-trained language models. ACM Comput Surv 563:1–37

    Google Scholar 

  27. Deng L, Yu D (2014) Deep learning: methods and applications. Found Trends Signal Process 73–4:197–387

    Article  MathSciNet  Google Scholar 

  28. Hussain T, Yang B, Rahman H.U, Iqbal A, Ali F, shah B (2022) Improving source location privacy in social internet of things using a hybrid phantom routing technique. Comput Secur 123:102917. https://doi.org/10.1016/j.cose.2022.102917

    Article  Google Scholar 

  29. Qazi UK, Ahmad I, Minallah N, Zeeshan M (2023) Classification of tobacco using remote sensing and deep learning techniques. Agron J. https://doi.org/10.1002/agj2.21382

    Article  Google Scholar 

  30. Batmaz Z, Yurekli A, Bilge A, Kaleli C (2019) A review on deep learning for recommender systems: challenges and remedies. Artif Intell Rev 52:1–37

    Article  Google Scholar 

  31. Jeong C, Jang S, Park E, Choi S (2020) A context-aware citation recommendation model with bert and graph convolutional networks. Scientometrics 124:1907–1922

    Article  Google Scholar 

  32. Nicholson J.M, Mordaunt M, Lopez P, Uppala A, Rosati D, Rodrigues N.P, Grabitz P, Rife S.C (2021) Scite: a smart citation index that displays the context of citations and classifies their intent using deep learning. Quant Sci Stud 23:882–898

    Article  Google Scholar 

  33. Liu J, **a F, Feng X, Ren J, Liu H (2022) Deep graph learning for anomalous citation detection. IEEE Trans Neural Netw Learn Syst 336:2543–2557

    Article  Google Scholar 

  34. Roy SS, Mercer RE (2022) Biocite: a deep learning-based citation linkage framework for biomedical research articles. In: Proceedings of the 21st workshop on biomedical language processing, pp 241–251

  35. Anderson M.H, Lemken R.K (2023) Citation context analysis as a method for conducting rigorous and impactful literature reviews. Organ Res Methods 261:77–106

    Article  Google Scholar 

  36. Gao T, Yen H, Yu J, Chen D (2023) Enabling large language models to generate text with citations. ar**v:2305.14627

  37. Kraemer HC (2014) Kappa coefficient. Wiley StatsRef: statistics reference online, pp 1–4

  38. McHugh ML (2012) Interrater reliability: the kappa statistic. Biochem Med 223:276–282

    Article  Google Scholar 

  39. Loye G (2019) Attention mechanism. https://blog.floydhub.com/attention-mechanism/ [Accessed: (Use the date of access)]

  40. Luong T, Pham H, Manning CD, Màrquez L, Callison-Burch C, Su J, Pighin D (2015) Effective approaches to attention-based neural machine translation. In: Marton Y (ed) Proceedings of the 2015 conference on empirical methods in natural language processing, EMNLP 2015, Lisbon, Portugal, September 17–21, 2015. The Association for Computational Linguistics, pp 1412–1421. https://doi.org/10.18653/V1/D15-1166

  41. botherrefKeras (2024) KerasNLP Tokenizers. https://keras.io/api/keras_nlp/tokenizers/

  42. Khalid A, Alam F, Ahmed I (2018) Extracting reference text from citation contexts. Clust Comput 21:605–622

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Iftikhar Ahmed.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khan, D., Ahmed, I., Ullah, I. et al. Finding the reference text in citation contexts using attention model. SOCA (2024). https://doi.org/10.1007/s11761-024-00410-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11761-024-00410-1

Keywords

Navigation