Abstract
With improvements in technology, India is growing at a fast pace, which has led to a great deal of urbanization. However, instead of reducing, the rate of crime has increased these past couple of years. The general public must be educated on how safe an area is, so that they may take the appropriate actions to protect themselves. Every day, we see many local crimes published in internet news articles, but not everyone has the time to read them all. They contain information that can be used to determine the safety of a location. Thus, in this paper, we propose an end-to-end solution based on Natural Language Processing to inform users of the crime rate in their area. We create a model that analyzes crimes mentioned in local news articles and collects data such as location and incident type. The model uses the concept of Named Entity Recognition to extract the locations and the crime that has occurred. To take advantage of the benefits of transfer learning, we built the model using Google's BERT framework. It was trained on CONELL2003 with custom modifications and was put to the test using real-time data gathered from several online news outlets’ crime pieces. Our model has an F1 score of 83.87% and a validation accuracy of 96%. The information collected via internet was visualized on a heat map using bokeh package. We display metrics such as name of the location, number of crimes occurred in that area and the recent most crime that has occurred which provides a quick overview and benefits our users.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Po L, Rollo F (2018) Building an urban theft map by analysing newspaper crime reports. In: 2018 13th International workshop on semantic and social media adaptation and personalization (SMAP), September 2018. https://doi.org/10.1109/SMAP.2018.8501866
Saldana M, Escobar C, Galvez E, Torres D, Toro N (2020) Map** of the perception of theft crimes from analysis of newspaper articles online. In: 15th Iberian conference on information systems and technologies (CISTI). IEEE. https://doi.org/10.23919/CISTI49556.2020.9141154
Bondielli A, Ducange P, Marcelloni F (2020) Exploiting categorization of online news for profiling city areas. In: 2020 IEEE conference on evolving and adaptive intelligent systems (EAIS), May 2020. https://doi.org/10.1109/EAIS48028.2020.9122777
Das P, Das AK (2017) Crime analysis against women from online newspaper reports and an approach to apply it in dynamic environment. In: 2017 International conference on big data analytics and computational intelligence (ICBDAC), IEEE. https://doi.org/10.1109/ICBDACI.2017.8070855
Arulanandam R, Savarimuthu BTR, Purvis MA (2014) Extracting crime information from online newspaper articles. In: The second Australasian web conference (AWC 2014)
Thongsatapornwatana U (2016) A survey of data mining techniques for analysing crime patterns. In: 2016 Second Asian conference on defense technology (ACDT). IEEE. https://doi.org/10.1109/ACDT.2016.7437655
Revathy K, and Satheesh Kumar J. Survey of data mining techniques on crime data analysis. Int J Data Min Tech Appl 1:47–49. https://doi.org/10.20894/IJDMTA.102.001.002.006
Bsoul Q, Salim J, Zakaria LQ (2013) An intelligent document clustering approach to detect crime patterns. In: the 4th International conference on electrical engineering and informatics (ICEEI 2013). Elsevier. https://doi.org/10.1016/j.protcy.2013.12.311
Nasridinov A, Park Y-H (2014) A study on performance evaluation of machine learning algorithms for crime dataset. In: Conference: networking and communication 2014. https://doi.org/10.14257/astl.2014.66.22
Ku CH, Iriberri A, Leroy G (2008) Crime information extraction from police and witness narrative reports. In: 2008 IEEE conference on technologies for homeland security. IEEE. https://doi.org/10.1109/THS.2008.4534448
Hassan M, Rahman MZ (2017) Crime news analysis: location and story detection. In: 20th International conference of computer and information technology (ICCIT), pp 1–6. https://doi.org/10.1109/ICCITECHN.2017.8281798
Jie Z, Lu W (2019) Dependency-guided LSTM-CRF for named entity recognition. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing, pp 3862–3872
Luo L, Yang Z, Yang P, Zhang Y, Wang L, Lin H, Wang J (2018) An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition. Bioinformatics 34(8):1381–1388. https://doi.org/10.1093/bioinformatics/btx761
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: 31st Conference on neural information processing systems (NIPS 2017). ar**v:1706.03762
Sun C, Qiu X, Xu Y, Huang X (2020) How to fine-tune BERT for text classification. ar**v:1905.05583v3
Yu S, Su J, Luo D (2019) Improving BERT-based text classification with auxiliary sentence and domain knowledge. IEEE Access 7:176600–176612. https://doi.org/10.1109/ACCESS.2019.2953990
Wang Y, Sun Y, Ma Z, Gao L, Xu Y, Sun T (2020) Application of pre-training models in named entity recognition. ar**v:2002.08902v1
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Trupthi, M., Rajole, P., Prabhu, N.D. (2024). Visualizing Crime Hotspots by Analysing Online Newspaper Articles. In: Borah, M.D., Laiphrakpam, D.S., Auluck, N., Balas, V.E. (eds) Big Data, Machine Learning, and Applications. BigDML 2021. Lecture Notes in Electrical Engineering, vol 1053. Springer, Singapore. https://doi.org/10.1007/978-981-99-3481-2_8
Download citation
DOI: https://doi.org/10.1007/978-981-99-3481-2_8
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-3480-5
Online ISBN: 978-981-99-3481-2
eBook Packages: Computer ScienceComputer Science (R0)