A Hybrid Machine Learning-Based Framework for Data Injection Attack Detection in Smart Grids Using PCA and Stacked Autoencoders

被引:0
|
作者
Tufail, Shahid [1 ]
Iqbal, Hasan [1 ]
Tariq, Mohd [1 ]
Sarwat, Arif I. [1 ]
机构
[1] Florida Int Univ, Dept Elect & Comp Engn, Miami, FL 33174 USA
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Smart grids; Principal component analysis; Accuracy; Autoencoders; Random forests; Data models; Machine learning algorithms; Dimensionality reduction; Computer security; Support vector machines; Photovoltaic (PV) systems; grid-connected PV systems; machine learning algorithms; random forest; autoencoders; multi-layer perceptron (MLP); principal component analysis (PCA); INTRUSION DETECTION; CYBER-SECURITY;
D O I
10.1109/ACCESS.2025.3543751
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cyberattacks, especially data injection attacks, are becoming more common as smart grids are increasingly interconnected. In addition, accurate and unbiased high-quality data is required for model training. Most of the data we collect from the real world is sparse, incomplete, inconsistent, and skewed. To address these issues, we have proposed a framework to detect such attacks in this study. Using a stacked autoencoder architecture, synthetic instances of minority class data were generated. The generated classes address the imbalances in the data to enhance the generalizability of the model and address diverse attack scenarios. Various machine learning algorithms were evaluated, and the Random Forest (RF) model consistently achieved superior accuracy, ranging from 99.32% to 95.89%. In particular, traditional algorithms such as Logistic Regression (LR) exhibited sensitivity to dimensionality reductions, experiencing a 16.96% accuracy drop when the principal components were reduced from all to 10. In contrast, RF demonstrated resilience, with only a 1.67% mean accuracy drop under similar conditions. Both RF and XGBoost (XGB) emerged as standout models, showcasing high accuracy and robust performance even with dimensionality reduction via principal component analysis (PCA). However, reducing PCA components from 10 to 5 led to performance decreases in all models. The Support Vector Machine (SVM) Classifier shows the highest accuracy drop of 14.21%. This study shows the importance of understanding algorithmic behavior and data features and how it can impact the performance of ML models. This analysis will strengthen cybersecurity in smart grids and focusing on the critical need for careful feature selection and tuning, particularly for models sensitive to dimensionality reduction.
引用
收藏
页码:33783 / 33798
页数:16
相关论文
共 50 条
  • [31] Leveraging High-Fidelity Datasets for Machine Learning-based Anomaly Detection in Smart Grids
    Hyder, Burhan
    Ahmed, Arman
    Mana, Priya
    Edgar, Thomas
    Niddodi, Shwetha
    2023 11TH WORKSHOP ON MODELLING AND SIMULATION OF CYBER-PHYSICAL ENERGY SYSTEMS, MSCPES, 2023,
  • [32] Machine Learning-Based Attack Detection for the Internet of Things
    Bikila, Dawit Dejene
    Capek, Jan
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
  • [33] Machine Learning-Based Attack Detection Method in Hadoop
    Li, Ningwei
    Gao, Hang
    Liu, Liang
    Peng, Jianfei
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT III, 2020, 12454 : 184 - 196
  • [34] Interval Observer-Based Detection and Localization Against False Data Injection Attack in Smart Grids
    Luo, Xiaoyuan
    Li, Yating
    Wang, Xinyu
    Guan, Xinping
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (02) : 657 - 671
  • [35] Adversarial Attack Detection in Smart Grids Using Deep Learning Architectures
    Ness, Stephanie
    IEEE ACCESS, 2025, 13 : 16314 - 16323
  • [36] Stacked autoencoders and extreme learning machine based hybrid model for electrical load prediction
    Peng, Wei
    Xu, Liwen
    Li, Chengdong
    Xie, Xiuying
    Zhang, Guiqing
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 5403 - 5416
  • [37] Dynamic Detection of False Data Injection Attack in Smart Grid using Deep Learning
    Niu, Xiangyu
    Li, Jiangnan
    Sun, Jinyuan
    Tomsovic, Kevin
    2019 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE (ISGT), 2019,
  • [38] Review of deep learning-based false data injection attack detection in power systems
    Li, Zhuo
    Xie, Yaobin
    Wu, Qianqiong
    Zhang, Youwei
    Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2024, 52 (19): : 175 - 187
  • [39] A novel detection and defense mechanism against false data injection attack in smart grids
    Cui, Jinlong
    Gao, Beibei
    Guo, Baojun
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2023, 17 (20) : 4514 - 4524
  • [40] Quickest Detection of False Data Injection Attack in Wide-Area Smart Grids
    Li, Shang
    Yilmaz, Yasin
    Wang, Xiaodong
    IEEE TRANSACTIONS ON SMART GRID, 2015, 6 (06) : 2725 - 2735