Big data analytics for identifying electricity theft using machine learning approaches in microgrids for smart communities

被引:20
|
作者
Arif, Arooj [1 ]
Javaid, Nadeem [1 ]
Aldegheishem, Abdulaziz [2 ]
Alrajeh, Nabil [3 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Islamabad 44000, Pakistan
[2] King Saud Univ KSU, Coll Architecture & Planning, Urban Planning Dept, Riyadh, Saudi Arabia
[3] King Saud Univ KSU, Biomed Technol Dept, Coll Appl Med Sci, Riyadh, Saudi Arabia
来源
关键词
big data; electricity theft detection; hyperactive optimization toolkit; machine learning; smart grids; urban planning; IMBALANCED DATA; OPTIMIZATION; SYSTEMS;
D O I
10.1002/cpe.6316
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Electricity theft (ET) causes major revenue loss in power utilities. It reduces the quality of supply, raises production cost, causes legal consumers to pay the higher cost, and impacts the economy as a whole. In this article, we use the State Grid Corporation of China (SGCC) dataset, which contains electricity consumption data of 1035 days for two classes: normal and fraudulent. In this work, ET detection model is proposed that consists of four steps: interpolation, data balancing, feature extraction, and classification. First, missing values of the dataset are recovered using the interpolation method. Second, resampling technique is implemented. ET consumers are 9% in the SGCC dataset that make the model inefficient to correctly classify both classes (normal and theft). A hybrid resampling technique is proposed, named synthetic minority oversampling technique with near miss. Third, residual network extracts the latent features from the SGCC dataset. Fourth, three tree based classifiers, such as decision tree (DT), random forest (RF), and adaptive boosting (AdaBoost) are applied to train the encoded feature vectors for classification. Besides, search for good hyperparameters is a challenging task, which is usually done manually and takes a considerable amount of time. To resolve this problem, Bayesian optimizer is used to simplify the tuning process of DT, RF, and AdaBoost. Finally, the results indicate that RF outperforms DT and AdaBoost.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Big Data Analytics for Electricity Theft Detection in Smart Grids
    Khan, Inam Ullah
    Javaid, Nadeem
    Taylor, C. James
    Gamage, Kelum A. A.
    Ma, Xiandong
    2021 IEEE MADRID POWERTECH, 2021,
  • [2] Towards Efficient Energy Utilization Using Big Data Analytics in Smart Cities for Electricity Theft Detection
    Arif, Arooj
    Alghamdi, Turki Ali
    Khan, Zahoor Ali
    Javaid, Nadeem
    BIG DATA RESEARCH, 2022, 27
  • [3] Electricity theft detection in smart grid using machine learning
    Iftikhar, Hasnain
    Khan, Nitasha
    Raza, Muhammad Amir
    Abbas, Ghulam
    Khan, Murad
    Aoudia, Mouloud
    Touti, Ezzeddine
    Emara, Ahmed
    FRONTIERS IN ENERGY RESEARCH, 2024, 12
  • [4] Detection of electricity theft in low voltage networks using analytics and machine learning
    Hashatsi, Mabatho
    Maulu, Chizeba
    Shuma-Iwisi, Mercy
    2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 322 - 327
  • [5] Machine Learning approaches on Map Reduce for Big Data Analytics
    Lakshmi, J. V. N.
    Sheshasaayee, Ananthi
    2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 480 - 484
  • [6] Renewable energy management in smart grids by using big data analytics and machine learning
    Mostafa, Noha
    Ramadan, Haitham Saad Mohamed
    Elfarouk, Omar
    MACHINE LEARNING WITH APPLICATIONS, 2022, 9
  • [7] Renewable energy management in smart grids by using big data analytics and machine learning
    Mostafa, Noha
    Ramadan, Haitham Saad Mohamed
    Elfarouk, Omar
    Machine Learning with Applications, 2022, 9
  • [8] Big Data Analytics using Machine Learning Techniques
    Mittal, Shweta
    Sangwan, Om Prakash
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 203 - 207
  • [9] Electricity Theft Detection Using Machine Learning Techniques to Secure Smart Grid
    Adil, Muhammad
    Javaid, Nadeem
    Ullah, Zia
    Maqsood, Mahad
    Ali, Salman
    Daud, Muhammad Awais
    COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, 2021, 1194 : 233 - 243
  • [10] Machine learning for big data analytics
    Oja, E. (erkki.oja@aalto.fi), 1600, Springer Verlag (384):