A Hybrid Machine Learning-Based Framework for Data Injection Attack Detection in Smart Grids Using PCA and Stacked Autoencoders

被引:0
|
作者
Tufail, Shahid [1 ]
Iqbal, Hasan [1 ]
Tariq, Mohd [1 ]
Sarwat, Arif I. [1 ]
机构
[1] Florida Int Univ, Dept Elect & Comp Engn, Miami, FL 33174 USA
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Smart grids; Principal component analysis; Accuracy; Autoencoders; Random forests; Data models; Machine learning algorithms; Dimensionality reduction; Computer security; Support vector machines; Photovoltaic (PV) systems; grid-connected PV systems; machine learning algorithms; random forest; autoencoders; multi-layer perceptron (MLP); principal component analysis (PCA); INTRUSION DETECTION; CYBER-SECURITY;
D O I
10.1109/ACCESS.2025.3543751
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cyberattacks, especially data injection attacks, are becoming more common as smart grids are increasingly interconnected. In addition, accurate and unbiased high-quality data is required for model training. Most of the data we collect from the real world is sparse, incomplete, inconsistent, and skewed. To address these issues, we have proposed a framework to detect such attacks in this study. Using a stacked autoencoder architecture, synthetic instances of minority class data were generated. The generated classes address the imbalances in the data to enhance the generalizability of the model and address diverse attack scenarios. Various machine learning algorithms were evaluated, and the Random Forest (RF) model consistently achieved superior accuracy, ranging from 99.32% to 95.89%. In particular, traditional algorithms such as Logistic Regression (LR) exhibited sensitivity to dimensionality reductions, experiencing a 16.96% accuracy drop when the principal components were reduced from all to 10. In contrast, RF demonstrated resilience, with only a 1.67% mean accuracy drop under similar conditions. Both RF and XGBoost (XGB) emerged as standout models, showcasing high accuracy and robust performance even with dimensionality reduction via principal component analysis (PCA). However, reducing PCA components from 10 to 5 led to performance decreases in all models. The Support Vector Machine (SVM) Classifier shows the highest accuracy drop of 14.21%. This study shows the importance of understanding algorithmic behavior and data features and how it can impact the performance of ML models. This analysis will strengthen cybersecurity in smart grids and focusing on the critical need for careful feature selection and tuning, particularly for models sensitive to dimensionality reduction.
引用
收藏
页码:33783 / 33798
页数:16
相关论文
共 50 条
  • [21] A locational false data injection attack detection method in smart grid based on adversarial variational autoencoders
    Wang, Yufeng
    Zhou, Yangming
    Ma, Jianhua
    Jin, Qun
    APPLIED SOFT COMPUTING, 2024, 151
  • [22] A machine learning-based detection framework against intermittent electricity theft attack
    Fang, Hongliang
    Xiao, Jiang-Wen
    Wang, Yan-Wu
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2023, 150
  • [23] Cluster partition-fuzzy broad learning-based fast detection and localization framework for false data injection attack in smart distribution networks
    An, Haopeng
    Xing, Yankai
    Zhang, Guangdou
    Bamisile, Olusola
    Li, Jian
    Huang, Qi
    SUSTAINABLE ENERGY GRIDS & NETWORKS, 2024, 40
  • [24] Network Intrusion Detection in Smart Grids for Imbalanced Attack Types Using Machine Learning Models
    Das Roy, Dipanjan
    Shin, Dongwan
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 576 - 581
  • [25] Anomaly Detection in Smart Grids using Machine Learning
    Shabad, Prem Kumar Reddy
    Alrashide, Abdulmueen
    Mohammed, Osama
    IECON 2021 - 47TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2021,
  • [26] Machine Learning-Based Intrusion Detection for Achieving Cybersecurity in Smart Grids Using IEC 61850 GOOSE Messages
    Ustun, Taha Selim
    Hussain, S. M. Suhail
    Ulutas, Ahsen
    Onen, Ahmet
    Roomi, Muhammad M.
    Mashima, Daisuke
    SYMMETRY-BASEL, 2021, 13 (05):
  • [27] Detection of False Data Injection Attack in Smart Grids via Interval Observer
    Wang, Xinyu
    Luo, Xiaoyuan
    Zhang, Mingyue
    Jiang, Zhongping
    Guan, Xinping
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 3238 - 3243
  • [28] Locational Detection of False Data Injection Attack in Smart Grid Based on Multilabel Machine Learning Classification Methods
    Zia, Muhammad Fahad
    Inayat, Usman
    Noor, Wafa
    Pangracious, Vinod
    Benbouzid, Mohamed
    2023 IEEE IAS GLOBAL CONFERENCE ON RENEWABLE ENERGY AND HYDROGEN TECHNOLOGIES, GLOBCONHT, 2023,
  • [29] Deep learning for online AC False Data Injection Attack detection in smart grids: An approach using LSTM-Autoencoder
    Yang, Liqun
    Zhai, You
    Li, Zhoujun
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2021, 193 (193)
  • [30] Detecting False Data Injection Attacks Using Machine Learning-Based Approaches for Smart Grid Networks
    Abudin, M. D. Jainul
    Thokchom, Surmila
    Naayagi, R. T.
    Panda, Gayadhar
    APPLIED SCIENCES-BASEL, 2024, 14 (11):