Machine Learning Algorithms for Crime Prediction under Indian Penal Code

被引:3
|
作者
Aziz R.M. [1 ]
Sharma P. [1 ]
Hussain A. [1 ]
机构
[1] VIT Bhopal University, Bhopal-Indore Highway, Kothrikalan, Sehore, M.P., Bhopal
关键词
Decision tree regression (DTR); Indian Penal Code (IPC); Mean absolute percentage error (MAPE); Natural language processing (NLP); Random forest regression (RFR); Support vector regression (SVR);
D O I
10.1007/s40745-022-00424-6
中图分类号
学科分类号
摘要
In this paper, the authors propose a data-driven approach to draw insightful knowledge from the Indian crime data. The proposed approach can be helpful for police and other law enforcement bodies in India for controlling and preventing crime region-wise. In the proposed approach different regression models are built based on different regression algorithms, viz., random forest regression (RFR), decision tree regression (DTR), multiple linear regression (MLR), simple linear regression (SLR), and support vector regression (SVR) after pre-processing the data using MySQL Workbench and R programming. These regression models can predict 28 different types of IPC cognizable crime counts and also a total number of Indian Penal Code (IPC) cognizable crime counts region-wise, state-wise, and year-wise (for all over the country) provided the desired inputs to the model. Data visualization techniques, namely, chord diagrams and map plots, are used to visualize pre-processed data (corresponding to the years 2014 to 2020) and predicted data by the relatively best regression model for the year 2022. For the chosen data, it is concluded that Random Forest Regression (RFR), which predicts total IPC cognizable crime, fits relatively the best, with a 0.96 adjusted r squared value and a MAPE value of 0.2, and among regression models predicting region-wise theft crime count, the random forest regression-based model relatively fits the best, with an adjusted R squared value of 0.96 and a MAPE value of 0.166. These regression models predict that Andhra Pradesh state will have the highest crime counts, with Adilabad district at the top, having 31,933 predicted crime counts. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2022.
引用
收藏
页码:379 / 410
页数:31
相关论文
共 50 条
  • [21] Comparison of Machine Learning Algorithms for Predicting Crime Hotspots
    Zhang, Xu
    Liu, Lin
    Xiao, Luzi
    Ji, Jiakai
    IEEE ACCESS, 2020, 8 : 181302 - 181310
  • [22] The Concept of Corporate Crime in Indonesian Penal Code Bill
    Isnawati, Muridah
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON SOCIAL SCIENCES (ICSS 2018), 2018, 226 : 1290 - 1295
  • [23] Prediction of heavy rainfall days over a peninsular Indian station using the machine learning algorithms
    Subrahmanyam, Kandula, V
    Ramsenthil, C.
    Imran, A. Girach
    Chakravorty, Aniket
    Sreedhar, R.
    Ezhilrajan, E.
    Subrahamanyam, D. Bala
    Ramachandran, Radhika
    Kumar, Karanam Kishore
    Rajasekhar, M.
    Jha, C. S.
    JOURNAL OF EARTH SYSTEM SCIENCE, 2021, 130 (04)
  • [24] Prediction of heavy rainfall days over a peninsular Indian station using the machine learning algorithms
    Kandula V Subrahmanyam
    C Ramsenthil
    A Girach Imran
    Aniket Chakravorty
    R Sreedhar
    E Ezhilrajan
    D Bala Subrahamanyam
    Radhika Ramachandran
    Karanam Kishore Kumar
    M Rajasekhar
    C S Jha
    Journal of Earth System Science, 2021, 130
  • [25] Crime Prediction Methods Based on Machine Learning: A Survey
    Yin, Junxiang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 4601 - 4629
  • [26] Crime Factor Anaysis and Prediction Using Machine Learning
    Anitha, N.
    Gowtham, S.
    Shri, M. Kaarniha
    Kalaiyarasi, T.
    INNOVATIONS IN BIO-INSPIRED COMPUTING AND APPLICATIONS, IBICA 2021, 2022, 419 : 307 - 313
  • [27] Machine Learning Algorithms in Stock Market Prediction
    Potdar, Jayesh
    Mathew, Rejo
    PROCEEDING OF THE INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS, BIG DATA AND IOT (ICCBI-2018), 2020, 31 : 192 - 197
  • [28] Diabetes Prediction using Machine Learning Algorithms
    Mujumdar, Aishwarya
    Vaidehi, V.
    2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 292 - 299
  • [29] Airline delay prediction by machine learning algorithms
    Khaksar, H.
    Sheikholeslami, A.
    SCIENTIA IRANICA, 2019, 26 (05) : 2689 - 2702
  • [30] Flare Index Prediction with Machine Learning Algorithms
    Anqin Chen
    Qian Ye
    Jingxiu Wang
    Solar Physics, 2021, 296