A Joint Label-Enhanced Representation Based on Pre-trained Model for Charge Prediction

Dan, **gpei; Liao, **aoshuang; Xu, Lanlin; Hu, Weixuan; Zhang, Tianyuan

doi:10.1007/978-3-031-17120-8_54

**gpei Dan¹¹,
**aoshuang Liao¹¹,
Lanlin Xu¹¹,
Weixuan Hu¹¹ &
…
Tianyuan Zhang¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13551))

Included in the following conference series:

CCF International Conference on Natural Language Processing and Chinese Computing

2485 Accesses
3 Altmetric

Abstract

As one of the important subtasks of legal judgment prediction, charge prediction aims to predict the final charge according to the fact description of a legal case. It can help make legal judgments or provide legal professional guidance for non-professionals. Most existing works focus on predicting charges only based on the fact description of a legal case while ignoring the semantic information of charge labels. Moreover, suffering from data imbalance in real applications, they are not applicable to predict few-shot charges by lack of training data. To address these issues, we propose a novel legal text presentation based on pre-trained model for charge prediction, named joint label-enhanced representation (JLER), which provides abundant information of charge labels as additional legal knowledge for pre-trained model to improve the charge prediction performance. JLER can improve predicting accuracy and interpretability by combining the charge label information enhanced by double-layer attention with legal text information, along with relieving the impact of data imbalance by fine-tuning pre-trained model from both text features side and charge label one. Experimental results on two real-world datasets demonstrate that our proposed model achieves significant and consistent improvements compared to the state-of-the-art baselines. Specifically, our model outperforms the baselines by about 13.9% accuracy on few-shot charge prediction. It is indicated that the proposed JLER model has good performance for charge prediction and is prospected to be applied to other subtasks of legal judgement prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 85.59; Price includes VAT (Germany)

Softcover Book: EUR 106.99; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Relation Learning Hierarchical Framework for Multi-label Charge Prediction

Label Definitions Augmented Interaction Model for Legal Charge Prediction

Multi-label charge predictions leveraging label co-occurrence in imbalanced data scenario

Article 23 May 2020

References

Nagel, S.S.: Applying correlation analysis to case prediction. Tex. l. Rev. 42(7), 1006–1017 (1964)
Google Scholar
Segal, J.A.: Predicting supreme court cases probabilistically: the search and seizure cases, 1962–1981. Am. Political Sci. Rev. 78(4), 891–900 (1984)
Article Google Scholar
Lauderdale, B.E., Clark, T.S.: The supreme court’s many median justices. Am. Political Sci. Rev. 106(4), 847–866 (2012)
Article Google Scholar
Katz, D.M., Bommarito, M.J., Blackman, J.: A general approach for predicting the behavior of the supreme court of the United States. PLoS ONE 12(4), e0174698 (2017)
Article Google Scholar
Hu, Z., Li, X., Tu, C., Liu, Z., Sun., M.: Few-shot charge prediction with discriminative legal attributes. In: Proceedings of the COLING (2018)
Google Scholar
Zhong, H., Guo, Z., Tu, C.: Legal judgment prediction via topological learning. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 3540–3549. Association for Computational Linguistics, Brussels, Belgium (2018)
Google Scholar
Luo, B., Feng, Y., Xu, J., Zhang, X., Zhao, D.: Learning to predict charges for criminal cases with legal basis, pp. 2727–2736 (2017)
Google Scholar
Shaghaghian, S., Feng, L.Y., Jafarpour, B., Pogrebnyakov, N.: Customizing contextualized language models for legal document reviews. In: Proceedings of the IEEE International Conference on Big Data (Big Data), pp. 2139–2148 (2020)
Google Scholar
Shao, Y., et al.: BERT-PLI: modeling paragraph-level interactions for legal case retrieval. In: Proceedings of IJCAI, pp. 3501–3507 (2020)
Google Scholar
Liu, Y., et al.: Roberta: a robustly optimized bert pretraining approach. Ar**v abs/1907.11692 (2019)
Google Scholar
Zhong, H., Zhang, Z., Liu, Z., Sun, M.: Open Chinese Language Pre-trained Model Zoo. Technical Report (2019)
Google Scholar
Chalkidis, I., Fergadiotis, M., Malakasiotis, P., Aletras, N., Androutsopoulos, I.: LEGAL-BERT: “preparing the muppets for court”. In: Proceedings of EMNLP: Findings, pp. 2898–2904 (2020)
Google Scholar
Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of NAACL HLT, pp. 4171–4186 (2019)
Google Scholar
**ao, C., et al.: CAIL2018: a large-scale legal dataset for judgment prediction. Ar**v abs/1807.02478 (2018)
Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification. EMNLP (2014)
Google Scholar
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)
Article Google Scholar
Suykens, J.A., Vandewalle, J.: Least squares support vector machine classifiers. Neural Process. Lett. 9(3), 293–300 (1999)
Article Google Scholar
Xu, K., et al.: Show, attend and tell: neural image caption generation with visual attention. In: Proceedings of the International Conference on Machine Learning (ICML), pp. 2048–2057 (2015)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. ar**v preprint ar**v:1412.6980 (2014)

Download references

Author information

Authors and Affiliations

Computer Science and Technology, Chongqing University, Chongqing, China
**gpei Dan, **aoshuang Liao, Lanlin Xu, Weixuan Hu & Tianyuan Zhang

Authors

**gpei Dan
View author publications
You can also search for this author in PubMed Google Scholar
**aoshuang Liao
View author publications
You can also search for this author in PubMed Google Scholar
Lanlin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Weixuan Hu
View author publications
You can also search for this author in PubMed Google Scholar
Tianyuan Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to **gpei Dan .

Editor information

Editors and Affiliations

Singapore University of Technology and Design, Singapore, Singapore
Wei Lu
Nan**g University, Nan**g, China
Shujian Huang
Soochow University, Suzhou, China
Yu Hong
Soochow University, Soochow, China
**abing Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dan, J., Liao, X., Xu, L., Hu, W., Zhang, T. (2022). A Joint Label-Enhanced Representation Based on Pre-trained Model for Charge Prediction. In: Lu, W., Huang, S., Hong, Y., Zhou, X. (eds) Natural Language Processing and Chinese Computing. NLPCC 2022. Lecture Notes in Computer Science(), vol 13551. Springer, Cham. https://doi.org/10.1007/978-3-031-17120-8_54

Download citation

DOI: https://doi.org/10.1007/978-3-031-17120-8_54
Published: 24 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-17119-2
Online ISBN: 978-3-031-17120-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)

A Joint Label-Enhanced Representation Based on Pre-trained Model for Charge Prediction

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Relation Learning Hierarchical Framework for Multi-label Charge Prediction

Label Definitions Augmented Interaction Model for Legal Charge Prediction

Multi-label charge predictions leveraging label co-occurrence in imbalanced data scenario

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

A Joint Label-Enhanced Representation Based on Pre-trained Model for Charge Prediction

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Relation Learning Hierarchical Framework for Multi-label Charge Prediction

Label Definitions Augmented Interaction Model for Legal Charge Prediction

Multi-label charge predictions leveraging label co-occurrence in imbalanced data scenario

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation