A Hybrid Machine Learning-Based Framework for Data Injection Attack Detection in Smart Grids Using PCA and Stacked Autoencoders

被引：0

作者：

Tufail, Shahid ^{[1
]}

Iqbal, Hasan ^{[1
]}

Tariq, Mohd ^{[1
]}

Sarwat, Arif I. ^{[1
]}

机构：

[1] Florida Int Univ, Dept Elect & Comp Engn, Miami, FL 33174 USA

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Smart grids; Principal component analysis; Accuracy; Autoencoders; Random forests; Data models; Machine learning algorithms; Dimensionality reduction; Computer security; Support vector machines; Photovoltaic (PV) systems; grid-connected PV systems; machine learning algorithms; random forest; autoencoders; multi-layer perceptron (MLP); principal component analysis (PCA); INTRUSION DETECTION; CYBER-SECURITY;

D O I：

10.1109/ACCESS.2025.3543751

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cyberattacks, especially data injection attacks, are becoming more common as smart grids are increasingly interconnected. In addition, accurate and unbiased high-quality data is required for model training. Most of the data we collect from the real world is sparse, incomplete, inconsistent, and skewed. To address these issues, we have proposed a framework to detect such attacks in this study. Using a stacked autoencoder architecture, synthetic instances of minority class data were generated. The generated classes address the imbalances in the data to enhance the generalizability of the model and address diverse attack scenarios. Various machine learning algorithms were evaluated, and the Random Forest (RF) model consistently achieved superior accuracy, ranging from 99.32% to 95.89%. In particular, traditional algorithms such as Logistic Regression (LR) exhibited sensitivity to dimensionality reductions, experiencing a 16.96% accuracy drop when the principal components were reduced from all to 10. In contrast, RF demonstrated resilience, with only a 1.67% mean accuracy drop under similar conditions. Both RF and XGBoost (XGB) emerged as standout models, showcasing high accuracy and robust performance even with dimensionality reduction via principal component analysis (PCA). However, reducing PCA components from 10 to 5 led to performance decreases in all models. The Support Vector Machine (SVM) Classifier shows the highest accuracy drop of 14.21%. This study shows the importance of understanding algorithmic behavior and data features and how it can impact the performance of ML models. This analysis will strengthen cybersecurity in smart grids and focusing on the critical need for careful feature selection and tuning, particularly for models sensitive to dimensionality reduction.

引用

页码：33783 / 33798

页数：16

共 50 条

[21] A locational false data injection attack detection method in smart grid based on adversarial variational autoencoders
Wang, Yufeng
Zhou, Yangming
Ma, Jianhua
Jin, Qun
APPLIED SOFT COMPUTING, 2024, 151
[22] A machine learning-based detection framework against intermittent electricity theft attack
Fang, Hongliang
Xiao, Jiang-Wen
Wang, Yan-Wu
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2023, 150
[23] Cluster partition-fuzzy broad learning-based fast detection and localization framework for false data injection attack in smart distribution networks
An, Haopeng
Xing, Yankai
Zhang, Guangdou
Bamisile, Olusola
Li, Jian
Huang, Qi
SUSTAINABLE ENERGY GRIDS & NETWORKS, 2024, 40
[24] Network Intrusion Detection in Smart Grids for Imbalanced Attack Types Using Machine Learning Models
Das Roy, Dipanjan
Shin, Dongwan
2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 576 - 581
[25] Anomaly Detection in Smart Grids using Machine Learning
Shabad, Prem Kumar Reddy
Alrashide, Abdulmueen
Mohammed, Osama
IECON 2021 - 47TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2021,
[26] Machine Learning-Based Intrusion Detection for Achieving Cybersecurity in Smart Grids Using IEC 61850 GOOSE Messages
Ustun, Taha Selim
Hussain, S. M. Suhail
Ulutas, Ahsen
Onen, Ahmet
Roomi, Muhammad M.
Mashima, Daisuke
SYMMETRY-BASEL, 2021, 13 (05):
[27] Detection of False Data Injection Attack in Smart Grids via Interval Observer
Wang, Xinyu
Luo, Xiaoyuan
Zhang, Mingyue
Jiang, Zhongping
Guan, Xinping
PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 3238 - 3243
[28] Locational Detection of False Data Injection Attack in Smart Grid Based on Multilabel Machine Learning Classification Methods
Zia, Muhammad Fahad
Inayat, Usman
Noor, Wafa
Pangracious, Vinod
Benbouzid, Mohamed
2023 IEEE IAS GLOBAL CONFERENCE ON RENEWABLE ENERGY AND HYDROGEN TECHNOLOGIES, GLOBCONHT, 2023,
[29] Deep learning for online AC False Data Injection Attack detection in smart grids: An approach using LSTM-Autoencoder
Yang, Liqun
Zhai, You
Li, Zhoujun
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2021, 193 (193)
[30] Detecting False Data Injection Attacks Using Machine Learning-Based Approaches for Smart Grid Networks
Abudin, M. D. Jainul
Thokchom, Surmila
Naayagi, R. T.
Panda, Gayadhar
APPLIED SCIENCES-BASEL, 2024, 14 (11):

← 1 2 3 4 5 →