A Hybrid Machine Learning-Based Framework for Data Injection Attack Detection in Smart Grids Using PCA and Stacked Autoencoders

被引：0

作者：

Tufail, Shahid ^{[1
]}

Iqbal, Hasan ^{[1
]}

Tariq, Mohd ^{[1
]}

Sarwat, Arif I. ^{[1
]}

机构：

[1] Florida Int Univ, Dept Elect & Comp Engn, Miami, FL 33174 USA

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Smart grids; Principal component analysis; Accuracy; Autoencoders; Random forests; Data models; Machine learning algorithms; Dimensionality reduction; Computer security; Support vector machines; Photovoltaic (PV) systems; grid-connected PV systems; machine learning algorithms; random forest; autoencoders; multi-layer perceptron (MLP); principal component analysis (PCA); INTRUSION DETECTION; CYBER-SECURITY;

D O I：

10.1109/ACCESS.2025.3543751

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cyberattacks, especially data injection attacks, are becoming more common as smart grids are increasingly interconnected. In addition, accurate and unbiased high-quality data is required for model training. Most of the data we collect from the real world is sparse, incomplete, inconsistent, and skewed. To address these issues, we have proposed a framework to detect such attacks in this study. Using a stacked autoencoder architecture, synthetic instances of minority class data were generated. The generated classes address the imbalances in the data to enhance the generalizability of the model and address diverse attack scenarios. Various machine learning algorithms were evaluated, and the Random Forest (RF) model consistently achieved superior accuracy, ranging from 99.32% to 95.89%. In particular, traditional algorithms such as Logistic Regression (LR) exhibited sensitivity to dimensionality reductions, experiencing a 16.96% accuracy drop when the principal components were reduced from all to 10. In contrast, RF demonstrated resilience, with only a 1.67% mean accuracy drop under similar conditions. Both RF and XGBoost (XGB) emerged as standout models, showcasing high accuracy and robust performance even with dimensionality reduction via principal component analysis (PCA). However, reducing PCA components from 10 to 5 led to performance decreases in all models. The Support Vector Machine (SVM) Classifier shows the highest accuracy drop of 14.21%. This study shows the importance of understanding algorithmic behavior and data features and how it can impact the performance of ML models. This analysis will strengthen cybersecurity in smart grids and focusing on the critical need for careful feature selection and tuning, particularly for models sensitive to dimensionality reduction.

引用

页码：33783 / 33798

页数：16

共 50 条

[31] Leveraging High-Fidelity Datasets for Machine Learning-based Anomaly Detection in Smart Grids
Hyder, Burhan
Ahmed, Arman
Mana, Priya
Edgar, Thomas
Niddodi, Shwetha
2023 11TH WORKSHOP ON MODELLING AND SIMULATION OF CYBER-PHYSICAL ENERGY SYSTEMS, MSCPES, 2023,
[32] Machine Learning-Based Attack Detection for the Internet of Things
Bikila, Dawit Dejene
Capek, Jan
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
[33] Machine Learning-Based Attack Detection Method in Hadoop
Li, Ningwei
Gao, Hang
Liu, Liang
Peng, Jianfei
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT III, 2020, 12454 : 184 - 196
[34] Interval Observer-Based Detection and Localization Against False Data Injection Attack in Smart Grids
Luo, Xiaoyuan
Li, Yating
Wang, Xinyu
Guan, Xinping
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (02) : 657 - 671
[35] Adversarial Attack Detection in Smart Grids Using Deep Learning Architectures
Ness, Stephanie
IEEE ACCESS, 2025, 13 : 16314 - 16323
[36] Stacked autoencoders and extreme learning machine based hybrid model for electrical load prediction
Peng, Wei
Xu, Liwen
Li, Chengdong
Xie, Xiuying
Zhang, Guiqing
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 5403 - 5416
[37] Dynamic Detection of False Data Injection Attack in Smart Grid using Deep Learning
Niu, Xiangyu
Li, Jiangnan
Sun, Jinyuan
Tomsovic, Kevin
2019 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE (ISGT), 2019,
[38] Review of deep learning-based false data injection attack detection in power systems
Li, Zhuo
Xie, Yaobin
Wu, Qianqiong
Zhang, Youwei
Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2024, 52 (19): : 175 - 187
[39] A novel detection and defense mechanism against false data injection attack in smart grids
Cui, Jinlong
Gao, Beibei
Guo, Baojun
IET GENERATION TRANSMISSION & DISTRIBUTION, 2023, 17 (20) : 4514 - 4524
[40] Quickest Detection of False Data Injection Attack in Wide-Area Smart Grids
Li, Shang
Yilmaz, Yasin
Wang, Xiaodong
IEEE TRANSACTIONS ON SMART GRID, 2015, 6 (06) : 2725 - 2735

← 1 2 3 4 5 →