Finding the reference text in citation contexts using attention model

Khan, Dilawar; Ahmed, Iftikhar; Ullah, Inam; Alwabli, Abdullah

doi:10.1007/s11761-024-00410-1

Finding the reference text in citation contexts using attention model

Special Issue Paper
Published: 22 May 2024

(2024)
Cite this article

Service Oriented Computing and Applications Aims and scope Submit manuscript

Dilawar Khan¹,
Iftikhar Ahmed ORCID: orcid.org/0000-0001-7863-3746^1,2,
Inam Ullah³ &
…
Abdullah Alwabli⁴

68 Accesses
Explore all metrics

Abstract

Precise reference text extraction from citation contexts (CCs) is important in computational linguistics and information retrieval applications. The extraction of CCs forms the basis of critical tasks like citation network analysis and literature search/recommendation systems, all of which hinge on the fidelity of extracted reference information. However, traditional methods, often relying on full sentences or fixed-window approaches, suffer from unnecessary information inclusion and negatively affecting accuracy. In this study, we aim to bridge this gap by introducing a novel deep learning approach utilizing an Attention Model to directly extract reference text from CCs. This eliminates the need to sift through lengthy surrounding text. The model was trained on a dataset consisting of 100 cited papers from the fields of Natural Language Processing and Computational Linguistics. Each paper was converted into a text file and sequentially loaded for training. Tokenization was applied to convert textual data into numerical values. In the training phase, we used the Adam optimizer and adjusted batch sizes based on the number of citations in each paper. The proposed model obtained promising results, achieving a macro-F1 score of 0.87. This demonstrates its superior performance compared to standard benchmark techniques such as conditional random field and dependency parsing. Additionally, our model outperformed the benchmark techniques in terms of precision, recall, and F-score.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Canada)

Instant access to the full article PDF.

Institutional subscriptions

Fig. 5

Citation Worthiness Identification for Fine-Grained Citation Recommendation Systems

Article 23 January 2022

Inline Citation Classification Using Peripheral Context and Time-Evolving Augmentation

Extracting reference text from citation contexts

Article 02 June 2017

Notes

References

Jebari C, Herrera-Viedma E, Cobo MJ (2023) Context-aware citation recommendation of scientific papers: comparative study, gaps and trends. Scientometrics 1288:4243–4268
Article Google Scholar
Suganya E, Vijayarani S (2021) Firefly optimization algorithm based web scra** for web citation extraction. Wirel Pers Commun 1182:1481–1505
Article Google Scholar
Caragea C, Bulgarov F, Godea A, Gollapalli SD (2014) Citation-enhanced keyphrase extraction from research papers: a supervised approach. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1435–1446
Liu S, Chen C (2013) The differences between latent topics in abstracts and citation contexts of citing papers. J Am Soc Inform Sci Technol 643:627–639
Article Google Scholar
Liang Y, Li Q (2011) Finding relevant papers based on citation relations. In: Web-age information management: 12th international conference, WAIM 2011, Wuhan, China, September 14–16, 2011. Proceedings 12. Springer, Berlin, pp 403–414
Bertin M, Jonin P, Armetta F, Atanassova I (2019) Identifying the conceptual space of citation contexts using coreferences. In: 4th joint workshop on bibliometric-enhanced information retrieval and natural language processing for digital libraries (BIRNDL 2019) at the 42nd international ACM SIGIR conference on research and development in information retrieval, vol 2414. CEUR-WS, pp 138–144
Cohan A, Goharian N (2017) Scientific article summarization using citation-context and article’s discourse structure. ar**v:1704.06619
Jha R, Jbara A-A, Qazvinian V, Radev DR (2017) NLP-driven citation analysis for scientometrics. Nat Lang Eng 231:93–130
Article Google Scholar
Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate. ar**v:1409.0473
Garfield E (1965) Can citation indexing be automated. In: Statistical association methods for mechanized documentation, symposium proceedings, vol 269. Citeseer, pp 189–192
Weinatoek M (1971) Citation indexes. Encycl Libr Inf Sci 5:16–40
Google Scholar
Wolfram D (2016) Bibliometrics, information retrieval and natural language processing: natural synergies to support digital library research. In: Proceedings of the joint workshop on bibliometric-enhanced information retrieval and natural language processing for digital libraries (BIRNDL), pp 6–13
Moro R, Vangel M, Bielikova M (2016) Identification of navigation lead candidates using citation and co-citation analysis. In: SOFSEM 2016: theory and practice of computer science: 42nd international conference on current trends in theory and practice of computer science, Harrachov, Czech Republic, January 23–28, 2016, proceedings 42. Springer, Berlin, pp 556–568
Bingol O.H, Doslu M (2018) Content sensitive document ranking method by analyzing the citation contexts. Google Patents. US Patent 10,157,225
Qazvinian V, Radev D (2010) Identifying non-explicit citing sentences for citation-based summarization. In: Proceedings of the 48th annual meeting of the association for computational linguistics, pp 555–564
Abu-Jbara A, Radev D (2011) Coherent citation-based summarization of scientific papers. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, pp 500–509
Jha R, Jbara A-A, Qazvinian V, Radev DR (2017) NLP-driven citation analysis for scientometrics. Nat Lang Eng 231:93–130
Article Google Scholar
Ma S, Zhang C, Liu X (2020) A review of citation recommendation: from textual content to enriched context. Scientometrics 122:1445–1472
Article Google Scholar
Dum, D, Sutton C, Klein, E (2016) Context matters: towards extracting a citation’s context using linguistic features. In: Proceedings of the 16th ACM/IEEE-CS on joint conference on digital libraries, pp 201–202
Ebesu T, Fang Y (2017) Neural citation network for context-aware citation recommendation. In: Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval, pp 1093–1096
Tuarob S, Mitra P, Giles CL (2013) A classification scheme for algorithm citation function in scholarly works. In: Proceedings of the 13th ACM/IEEE-CS joint conference on digital libraries, pp 367–368
Li X, He Y, Meyers A, Grishman R (2013) Towards fine-grained citation function classification. In: Proceedings of the international conference recent advances in natural language processing RANLP 2013, pp 402–407
Pride D, Knoth P (2017) Incidental or influential?-challenges in automatically detecting citation importance using publication full texts. In: Research and advanced technology for digital libraries: 21st international conference on theory and practice of digital libraries, TPDL 2017, Thessaloniki, Greece, September 18–21, 2017, Proceedings 21. Springer, Berlin, pp 572–578
Valenzuela M, Ha V, Etzioni O (2015) Identifying meaningful citations. In: AAAI workshop: scholarly big data, vol 15, p 13
Ahmad I, Alqarni MA, Almazroi AA, Tariq A (2020) Experimental evaluation of clickbait detection using machine learning models. Intell Autom Soft Comput 266:1335–1344
Article Google Scholar
Zhang H, Song H, Li S, Zhou M, Song D (2023) A survey of controllable text generation using transformer-based pre-trained language models. ACM Comput Surv 563:1–37
Google Scholar
Deng L, Yu D (2014) Deep learning: methods and applications. Found Trends Signal Process 73–4:197–387
Article MathSciNet Google Scholar
Hussain T, Yang B, Rahman H.U, Iqbal A, Ali F, shah B (2022) Improving source location privacy in social internet of things using a hybrid phantom routing technique. Comput Secur 123:102917. https://doi.org/10.1016/j.cose.2022.102917
Article Google Scholar
Qazi UK, Ahmad I, Minallah N, Zeeshan M (2023) Classification of tobacco using remote sensing and deep learning techniques. Agron J. https://doi.org/10.1002/agj2.21382
Article Google Scholar
Batmaz Z, Yurekli A, Bilge A, Kaleli C (2019) A review on deep learning for recommender systems: challenges and remedies. Artif Intell Rev 52:1–37
Article Google Scholar
Jeong C, Jang S, Park E, Choi S (2020) A context-aware citation recommendation model with bert and graph convolutional networks. Scientometrics 124:1907–1922
Article Google Scholar
Nicholson J.M, Mordaunt M, Lopez P, Uppala A, Rosati D, Rodrigues N.P, Grabitz P, Rife S.C (2021) Scite: a smart citation index that displays the context of citations and classifies their intent using deep learning. Quant Sci Stud 23:882–898
Article Google Scholar
Liu J, **a F, Feng X, Ren J, Liu H (2022) Deep graph learning for anomalous citation detection. IEEE Trans Neural Netw Learn Syst 336:2543–2557
Article Google Scholar
Roy SS, Mercer RE (2022) Biocite: a deep learning-based citation linkage framework for biomedical research articles. In: Proceedings of the 21st workshop on biomedical language processing, pp 241–251
Anderson M.H, Lemken R.K (2023) Citation context analysis as a method for conducting rigorous and impactful literature reviews. Organ Res Methods 261:77–106
Article Google Scholar
Gao T, Yen H, Yu J, Chen D (2023) Enabling large language models to generate text with citations. ar**v:2305.14627
Kraemer HC (2014) Kappa coefficient. Wiley StatsRef: statistics reference online, pp 1–4
McHugh ML (2012) Interrater reliability: the kappa statistic. Biochem Med 223:276–282
Article Google Scholar
Loye G (2019) Attention mechanism. https://blog.floydhub.com/attention-mechanism/ [Accessed: (Use the date of access)]
Luong T, Pham H, Manning CD, Màrquez L, Callison-Burch C, Su J, Pighin D (2015) Effective approaches to attention-based neural machine translation. In: Marton Y (ed) Proceedings of the 2015 conference on empirical methods in natural language processing, EMNLP 2015, Lisbon, Portugal, September 17–21, 2015. The Association for Computational Linguistics, pp 1412–1421. https://doi.org/10.18653/V1/D15-1166
botherrefKeras (2024) KerasNLP Tokenizers. https://keras.io/api/keras_nlp/tokenizers/
Khalid A, Alam F, Ahmed I (2018) Extracting reference text from citation contexts. Clust Comput 21:605–622
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Technology, University of Engineering and Technology, Peshawar, 2500, Pakistan
Dilawar Khan & Iftikhar Ahmed
Department of Software Engineering, University of Europe for Applied Sciences, 14469, Potsdam, Germany
Iftikhar Ahmed
College of Mechatronics and Control Engineering, Shenzhen University, Shenzhen, 518060, China
Inam Ullah
Department of Electrical Engineering, College of Engineering and Computing in Al-Qunfudhah, Umm al-Qura University, 24231, Makkah, Saudi Arabia
Abdullah Alwabli

Authors

Dilawar Khan
View author publications
You can also search for this author in PubMed Google Scholar
Iftikhar Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Inam Ullah
View author publications
You can also search for this author in PubMed Google Scholar
Abdullah Alwabli
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Iftikhar Ahmed.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Khan, D., Ahmed, I., Ullah, I. et al. Finding the reference text in citation contexts using attention model. SOCA (2024). https://doi.org/10.1007/s11761-024-00410-1

Download citation

Received: 08 February 2024
Revised: 20 April 2024
Accepted: 10 May 2024
Published: 22 May 2024
DOI: https://doi.org/10.1007/s11761-024-00410-1

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Canada)

Instant access to the full article PDF.

Institutional subscriptions

Finding the reference text in citation contexts using attention model

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Citation Worthiness Identification for Fine-Grained Citation Recommendation Systems

Inline Citation Classification Using Peripheral Context and Time-Evolving Augmentation

Extracting reference text from citation contexts

Notes

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Finding the reference text in citation contexts using attention model

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Citation Worthiness Identification for Fine-Grained Citation Recommendation Systems

Inline Citation Classification Using Peripheral Context and Time-Evolving Augmentation

Extracting reference text from citation contexts

Notes

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation