A Hybrid Machine Learning-Based Framework for Data Injection Attack Detection in Smart Grids Using PCA and Stacked Autoencoders

被引:0
|
作者
Tufail, Shahid [1 ]
Iqbal, Hasan [1 ]
Tariq, Mohd [1 ]
Sarwat, Arif I. [1 ]
机构
[1] Florida Int Univ, Dept Elect & Comp Engn, Miami, FL 33174 USA
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Smart grids; Principal component analysis; Accuracy; Autoencoders; Random forests; Data models; Machine learning algorithms; Dimensionality reduction; Computer security; Support vector machines; Photovoltaic (PV) systems; grid-connected PV systems; machine learning algorithms; random forest; autoencoders; multi-layer perceptron (MLP); principal component analysis (PCA); INTRUSION DETECTION; CYBER-SECURITY;
D O I
10.1109/ACCESS.2025.3543751
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cyberattacks, especially data injection attacks, are becoming more common as smart grids are increasingly interconnected. In addition, accurate and unbiased high-quality data is required for model training. Most of the data we collect from the real world is sparse, incomplete, inconsistent, and skewed. To address these issues, we have proposed a framework to detect such attacks in this study. Using a stacked autoencoder architecture, synthetic instances of minority class data were generated. The generated classes address the imbalances in the data to enhance the generalizability of the model and address diverse attack scenarios. Various machine learning algorithms were evaluated, and the Random Forest (RF) model consistently achieved superior accuracy, ranging from 99.32% to 95.89%. In particular, traditional algorithms such as Logistic Regression (LR) exhibited sensitivity to dimensionality reductions, experiencing a 16.96% accuracy drop when the principal components were reduced from all to 10. In contrast, RF demonstrated resilience, with only a 1.67% mean accuracy drop under similar conditions. Both RF and XGBoost (XGB) emerged as standout models, showcasing high accuracy and robust performance even with dimensionality reduction via principal component analysis (PCA). However, reducing PCA components from 10 to 5 led to performance decreases in all models. The Support Vector Machine (SVM) Classifier shows the highest accuracy drop of 14.21%. This study shows the importance of understanding algorithmic behavior and data features and how it can impact the performance of ML models. This analysis will strengthen cybersecurity in smart grids and focusing on the critical need for careful feature selection and tuning, particularly for models sensitive to dimensionality reduction.
引用
收藏
页码:33783 / 33798
页数:16
相关论文
共 50 条
  • [41] False Data Injection Attack Detection for Secure Distributed Demand Response in Smart Grids
    Dayaratne, Thusitha
    Salehi, Mahsa
    Rudolph, Carsten
    Liebman, Ariel
    2022 52ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN 2022), 2022, : 367 - 380
  • [42] Theft Cyberattacks Detection in Smart Grids Based on Machine Learning
    Ali, Abdelfatah
    Mokhtar, Mohamed
    Shaaban, Mostafa F.
    2022 5TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA), 2022,
  • [43] A Semantic Learning-Based SQL Injection Attack Detection Technology
    Lu, Dongzhe
    Fei, Jinlong
    Liu, Long
    ELECTRONICS, 2023, 12 (06)
  • [44] A Machine Learning-Based Attack Detection and Prevention System in Vehicular Named Data Networking
    Magsi, Arif Hussain
    Ghulam, Ali
    Memon, Saifullah
    Javeed, Khalid
    Alhussein, Musaed
    Rida, Imad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (02): : 1445 - 1465
  • [45] A machine learning-based Anomaly Detection Framework for building electricity consumption data
    Mascali, Lorenzo
    Schiera, Daniele Salvatore
    Eiraudo, Simone
    Barbierato, Luca
    Giannantonio, Roberta
    Patti, Edoardo
    Bottaccioli, Lorenzo
    Lanzini, Andrea
    SUSTAINABLE ENERGY GRIDS & NETWORKS, 2023, 36
  • [46] Distributed Anomaly Detection in Smart Grids: A Federated Learning-Based Approach
    Jithish, J.
    Alangot, Bithin
    Mahalingam, Nagarajan
    Yeo, Kiat Seng
    IEEE ACCESS, 2023, 11 : 7157 - 7179
  • [47] Fourier Singular Values-Based False Data Injection Attack Detection in AC Smart-Grids
    Dehghani, Moslem
    Niknam, Taher
    Ghiasi, Mohammad
    Siano, Pierluigi
    Alhelou, Hassan Haes
    Al-Hinai, Amer
    APPLIED SCIENCES-BASEL, 2021, 11 (12):
  • [48] Detection Method for Tolerable False Data Injection Attack Based on Deep Learning Framework
    He, Sizhe
    Zhou, Yadong
    Lv, Xiaoliang
    Chen, Wei
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 6717 - 6721
  • [49] Electricity Theft Detection in a Smart Grid Using Hybrid Deep Learning-Based Data Analysis Technique
    Mbey, Camille Franklin
    Bikai, Jacques
    Souhe, Felix Ghislain Yem
    Kakeu, Vinny Junior Foba
    Boum, Alexandre Teplaira
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2024, 2024
  • [50] Blind False Data Injection Attack Using PCA Approximation Method in Smart Grid
    Yu, Zong-Han
    Chin, Wen-Long
    IEEE TRANSACTIONS ON SMART GRID, 2015, 6 (03) : 1219 - 1226