Abstract
Accurate identification of named entities is pivotal for the advancement of sophisticated legal Artificial Intelligence (AI) applications. However, the legal domain presents distinct challenges due to the presence of fine-grained, domain-specific entities, including lawyers, judges, courts, and precedents. This necessitates a nuanced approach to Named Entity Recognition (NER).
In this paper, we introduce a novel NER approach tailored to the legal domain. Our system combines Robustly Optimized BERT (RoBERTa) with a Graph Convolutional Network (GCN) to harness two distinct types of complementary information related to words in the data. Furthermore, the application of a Conditional Random Field (CRF) at the output layer ensures global consistency in data labeling by considering the entire sequence when predicting a named entity. RoBERTa captures contextual information about individual words, while GCN allows us to exploit the mutual relationships between words, resulting in more precise named entity identification. Our results indicate that RoBERTa-GCN (CRF) outperforms other standard settings, such as, RoBERTa, textGCN, and BiLSTM, including state-of-the-art for NER in the legal domain.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chiu, J.P., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 4, 357–370 (2016)
Finkel, J.R., Manning, C.D.: Nested named entity recognition. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 141–150 (2009)
Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF models for sequence tagging. ar**v preprint ar**v:1508.01991 (2015)
Kalamkar, P., Agarwal, A., Tiwari, A., Gupta, S., Karn, S., Raghavan, V.: Named entity recognition in Indian court judgments. In: Proceedings of the Natural Legal Language Processing Workshop 2022, pp. 184–193. Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Hybrid), December 2022. https://aclanthology.org/2022.nllp-1.15
Kuru, O., Can, O.A., Yuret, D.: Charner: Character-level named entity recognition. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 911–921 (2016)
Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. ar**v preprint ar**v:1603.01360 (2016)
Leitner, E., Rehm, G., Moreno-Schneider, J.: A dataset of German legal documents for named entity recognition. In: Proceedings of the 22nd Nordic Conference on Computational Linguistics, pp. 381–386 (2019)
Marrero, M., Urbano, J., Sánchez-Cuadrado, S., Morato, J., Gómez-Berbís, J.M.: Named entity recognition: fallacies, challenges and opportunities. Comput. Stand. Interfaces 35(5), 482–489 (2013)
Pais, V., Mitrofan, M., Gasan, C.L., Coneschi, V., Ianov, A.: Named entity recognition in the romanian legal domain. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 157–168 (2019)
Reimers, N., Eckle-Kohler, J., Schnober, C., Kim, J., Gurevych, I.: Germeval-2014: Nested named entity recognition with neural networks. In: Proceedings of the 4th Workshop on Semantic Evaluation (SemEval-2014), pp. 244–250 (2014)
Ritter, A., Clark, S., Etzioni, O., et al.: Named entity recognition in tweets: an experimental study. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pp. 1524–1534 (2011)
Yao, L., Mao, C., Luo, Y.: Graph convolutional networks for text classification. ar**v preprint ar**v:2008.05959 (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Jain, A., Sharma, R. (2024). Enhancing Legal Named Entity Recognition Using RoBERTa-GCN with CRF: A Nuanced Approach for Fine-Grained Entity Recognition. In: Goharian, N., et al. Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, vol 14610. Springer, Cham. https://doi.org/10.1007/978-3-031-56063-7_17
Download citation
DOI: https://doi.org/10.1007/978-3-031-56063-7_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-56062-0
Online ISBN: 978-3-031-56063-7
eBook Packages: Computer ScienceComputer Science (R0)