Named Entity Recognition in Natural Language Processing: A Systematic Review

  • Conference paper
  • First Online:
Proceedings of Second Doctoral Symposium on Computational Intelligence

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1374))

Abstract

The enormous growth and availability of data poses a great challenge for extracting useful information from documents written in natural language. The information extraction task has become a vital activity in all domains. The process of identifying the names of organization, people, locations or other entities in text is called named entity recognition (NER). It is the subtask and plays an important part to discover and classify the names such as organization name, person name or the location. This is one of the trending fields and most important step in the natural language processing (NLP) for analysis of text. Research on NER changed a lot in the recent decade. NER can consequently examine the entire articles and reveal the individuals, associations, and spots talked about in text. Knowing the applicable labels for every single article help in naturally arranging the articles in all around characterized progressive systems and endorse smooth content disclosure. The pretension of this paper is to present survey on NER. The prime contribution of this research to present state-of-the-art NER is systematically reviewed according to techniques used in NER. This paper also provides tools, datasets, techniques, challenges and future directions in the field of NER with the aim of providing researchers the substantial knowledge for further work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Nadeau, D., & Sekine, S. (2007). A survey of named entity recognition and classification. Lingvisticae Investigationes, 30(1), 3–26.

    Article  Google Scholar 

  2. Petasis, G., Cucchiarelli, A., Velardi, P., Paliouras, G., Karkaletsis, V., & Spyropoulos, C. D. (2000). Automatic adaptation of proper noun dictionaries through cooperation of machine learning and probabilistic methods. Proceedings of SIGIR, 128–135.

    Google Scholar 

  3. Li, J., Sun, A., Han, J., & Li, C. (2018). A survey on deep learning for named entity recognition. IEEE Transactions On Knowledge And Data Engineering, 20.

    Google Scholar 

  4. Marrero, M., Urbano, J., Sánchez-Cuadrado, S., Morato, J., & Gómez-Berbís, J. M. (2013). Named entity recognition: Fallacies, challenges and opportunities. Computer Standards & Interfaces, 35, 482–489.

    Article  Google Scholar 

  5. Weischedel, R., Hovy, E., Marcus, M., Palmer, M., Belvin, R., Pradhan, S., Ramshaw, L., & Xue, N. (2011). OntoNotes: A large training corpus for enhanced processing. In Handbook of natural language Processing and machine translation: DARPA global autonomous language exploitation. Springer.

    Google Scholar 

  6. Sang, E. F.T . K. (2002). Introduction to the conll-2002 shared task: Language-independent named entity recognition. In Proceedings of the 6th Conference on Natural Language Learning. Stroudsburg, PA, USA. Association for Computational Linguistics (Vol. 31, pp. 1–4).

    Google Scholar 

  7. Kim, J. D., & Ohta, T. (2003). GENIA corpus-a semantically annotated corpus for bio-textmining (Vol. 19).

    Google Scholar 

  8. Grishman, R., & Sundheim, B. (1996). Message understanding conference-6: A brief history. In Proceedings of the 16th Conference on Computational Linguistics, COLING (Vol. 1, pp. 466–471).

    Google Scholar 

  9. Bird, S., Loper, E., & Klein, E. (2009). Natural language processing with python (Vol. 36, pp. 767–771). O’Reilly Media Inc.

    Google Scholar 

  10. Al-Rfou, R., Kulkarni, V., & Perozzi, B. (2014). POLYGLOT-NER: Massive multilingual named entity recognition (Vol. 1).

    Google Scholar 

  11. Manning, C., Surdeanu, M., & Bauer, J. (2014). The Stanford CoreNLP natural language processing toolkit. Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations (pp. 55–60).

    Google Scholar 

  12. Kang, Y., Cai, Z., Tan, C.-W., Huang, Q., & Liu, H. (2020). Natural language processing (NLP) in management research: A literature review. Journal of Management Analytics, 7(2), 139–172.

    Article  Google Scholar 

  13. Gardner, M., Grus, J., Neumann, M., Tafjord, O., Dasigi, P., Liu, N., Peters, M., Schmitz, M., & Zettlemoyer, L. (2017). AllenNLP: A deep semantic natural language processing platform. In Proceedings of Workshop for NLP Open Source Software (NLP-OSS), Technical report (pp. 1–6,).

    Google Scholar 

  14. Neumann, M., & King, D. (2019). ScispaCy: Fast and robust models for biomedical natural language processing. In Proceedings of the BioNLP workshop, 319–327.

    Google Scholar 

  15. Goyal, A., Gupta, V., & Kumar, M. (2018). Recent named entity recognition and classification techniques: A systematic review. Computer Science Review, 29, 21–43.

    Article  Google Scholar 

  16. Lin, B. Y., Xu, F., Luo, Z., & Zhu, K. (2017). Multi-channel bilstm-crf model for emerging named entity recognition in social media. In Proceedings of the 3rd Workshop on Noisy User-generated Text (pp. 160–165).

    Google Scholar 

  17. Peters, M. E., Ammar, W., Bhagavatula, C., & Power, R. (2017). Semisupervised sequence tagging with bidirectional language models. In Proceedings of ACL (pp. 1756–1765).

    Google Scholar 

  18. Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., & Kuksa, P. (2011). Natural language processing (almost) from scratch. Journal of Machine Learning Research, 12, 2493–2537.

    MATH  Google Scholar 

  19. Ju, M., Miwa, M., & Ananiadou, S. (2018). A neural layered model for nested named entity recognition. Proceedings of NAACL-HLT, 1, 1446–1459.

    Google Scholar 

  20. Yang, Z., Salakhutdinov, R., & Cohen, W. (2016). Multi-task cross-lingual sequence tagging from scratch. ar**v. 2.

    Google Scholar 

  21. Rei, M. (2017). Semi-supervised multitask learning for sequence labeling. Proceedings of ACL (pp. 2121–2130).

    Google Scholar 

  22. Nadeau, D., Turney, P. D., & Matwin, S. (2006). Unsupervised named entity recognition: Generating gazetteers and resolving ambiguity. In Proceedings of the Canadian Society for Computational Studies of Intelligence (pp. 266–277). Springer.

    Google Scholar 

  23. Zhang, S., & Elhadad, N. (2013). Unsupervised biomedical named entity recognition: Experiments with clinical and biological texts. Journal of Biomedical Information, 46, 1088–1098.

    Article  Google Scholar 

  24. Däniken, P. V., & Cieliebak, M. (2017)T. ransfer learning and sentence level features for named entity recognition on tweets. In Proceedings of the 3rd Workshop on Noisy User-generated Text (pp. 166–171).

    Google Scholar 

  25. Zhao, H., Yang, Y., Zhang, Q., & Si, L. (2018). Improve neural entity recognition via multi-task data selection and constrained decoding. NAACL-HLT, 2, 346–351.

    Google Scholar 

  26. Sutton, C., McCallum, A., & Rohanimanesh, K. (2007). Dynamic conditional random fields: Factorized probabilistic models for labeling and segmenting sequence data. Journal of Machine Learning Research, 8, 693–723.

    MATH  Google Scholar 

  27. Lin, B. Y., & Lu, W. (2018). Neural adaptation layers for cross-domain named entity recognition. Proceedings of AAAI, 12, 2012–2022.

    Google Scholar 

  28. Tomori, S., Ninomiya, T., & Mori, S. (2016). Domain specific named entity recognition referring to the real world by deep neural networks. Proceedings of ACL, 2, 236–242.

    Google Scholar 

  29. Yadav, V., & Bethard, S. (2018). A survey on recent advances in named entity recognition from deep learning models. In Proceedings of COLING (pp. 2145–2158).

    Google Scholar 

  30. Rei, M., Crichton, G. K., & Pyysalo, S. (2016). Attending to characters in neural sequence labeling models. In Proceedings of COLING (pp. 309–318).

    Google Scholar 

  31. Zukov-Gregoric, A., Bachrach, Y., Minkovsky, P., Coope, S., & Maksak, B. (2017). Neural named entity recognition using a selfattention mechanism. In Proceedings of ICTAI (pp. 652–656).

    Google Scholar 

  32. Xu, G., Wang, C., & He, X. (2018). Improving clinical named entity recognition with global neural attention. In Proceedings of APWeb-WAIM (pp. 264–279).

    Google Scholar 

  33. Zhang, Q., Fu, J., Liu, X., & Huang, X. (2018). Adaptive co-attention network for named entity recognition in tweets. In AAAI.

    Google Scholar 

  34. Batmaz, Z., Yurekli, A., Bilge, A., & Kaleli, C. (2018). A review on deep learning for recommender systems: Challenges and remedies. Artificial Intelligence Review, 1–37.

    Google Scholar 

  35. Akbik, A., Blythe, D., & Vollgraf, R. (2018). Contextual string embeddings for sequence labeling. In Proceedings of COLING (pp. 1638–1649).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sharma, A., Amrita, Chakraborty, S., Kumar, S. (2022). Named Entity Recognition in Natural Language Processing: A Systematic Review. In: Gupta, D., Khanna, A., Kansal, V., Fortino, G., Hassanien, A.E. (eds) Proceedings of Second Doctoral Symposium on Computational Intelligence . Advances in Intelligent Systems and Computing, vol 1374. Springer, Singapore. https://doi.org/10.1007/978-981-16-3346-1_66

Download citation

Publish with us

Policies and ethics

Navigation