A Hybrid Machine Learning-Based Framework for Data Injection Attack Detection in Smart Grids Using PCA and Stacked Autoencoders

被引：0

作者：

Tufail, Shahid ^{[1
]}

Iqbal, Hasan ^{[1
]}

Tariq, Mohd ^{[1
]}

Sarwat, Arif I. ^{[1
]}

机构：

[1] Florida Int Univ, Dept Elect & Comp Engn, Miami, FL 33174 USA

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Smart grids; Principal component analysis; Accuracy; Autoencoders; Random forests; Data models; Machine learning algorithms; Dimensionality reduction; Computer security; Support vector machines; Photovoltaic (PV) systems; grid-connected PV systems; machine learning algorithms; random forest; autoencoders; multi-layer perceptron (MLP); principal component analysis (PCA); INTRUSION DETECTION; CYBER-SECURITY;

D O I：

10.1109/ACCESS.2025.3543751

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cyberattacks, especially data injection attacks, are becoming more common as smart grids are increasingly interconnected. In addition, accurate and unbiased high-quality data is required for model training. Most of the data we collect from the real world is sparse, incomplete, inconsistent, and skewed. To address these issues, we have proposed a framework to detect such attacks in this study. Using a stacked autoencoder architecture, synthetic instances of minority class data were generated. The generated classes address the imbalances in the data to enhance the generalizability of the model and address diverse attack scenarios. Various machine learning algorithms were evaluated, and the Random Forest (RF) model consistently achieved superior accuracy, ranging from 99.32% to 95.89%. In particular, traditional algorithms such as Logistic Regression (LR) exhibited sensitivity to dimensionality reductions, experiencing a 16.96% accuracy drop when the principal components were reduced from all to 10. In contrast, RF demonstrated resilience, with only a 1.67% mean accuracy drop under similar conditions. Both RF and XGBoost (XGB) emerged as standout models, showcasing high accuracy and robust performance even with dimensionality reduction via principal component analysis (PCA). However, reducing PCA components from 10 to 5 led to performance decreases in all models. The Support Vector Machine (SVM) Classifier shows the highest accuracy drop of 14.21%. This study shows the importance of understanding algorithmic behavior and data features and how it can impact the performance of ML models. This analysis will strengthen cybersecurity in smart grids and focusing on the critical need for careful feature selection and tuning, particularly for models sensitive to dimensionality reduction.

引用

页码：33783 / 33798

页数：16

共 50 条

[41] False Data Injection Attack Detection for Secure Distributed Demand Response in Smart Grids
Dayaratne, Thusitha
Salehi, Mahsa
Rudolph, Carsten
Liebman, Ariel
2022 52ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN 2022), 2022, : 367 - 380
[42] Theft Cyberattacks Detection in Smart Grids Based on Machine Learning
Ali, Abdelfatah
Mokhtar, Mohamed
Shaaban, Mostafa F.
2022 5TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA), 2022,
[43] A Semantic Learning-Based SQL Injection Attack Detection Technology
Lu, Dongzhe
Fei, Jinlong
Liu, Long
ELECTRONICS, 2023, 12 (06)
[44] A Machine Learning-Based Attack Detection and Prevention System in Vehicular Named Data Networking
Magsi, Arif Hussain
Ghulam, Ali
Memon, Saifullah
Javeed, Khalid
Alhussein, Musaed
Rida, Imad
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (02): : 1445 - 1465
[45] A machine learning-based Anomaly Detection Framework for building electricity consumption data
Mascali, Lorenzo
Schiera, Daniele Salvatore
Eiraudo, Simone
Barbierato, Luca
Giannantonio, Roberta
Patti, Edoardo
Bottaccioli, Lorenzo
Lanzini, Andrea
SUSTAINABLE ENERGY GRIDS & NETWORKS, 2023, 36
[46] Distributed Anomaly Detection in Smart Grids: A Federated Learning-Based Approach
Jithish, J.
Alangot, Bithin
Mahalingam, Nagarajan
Yeo, Kiat Seng
IEEE ACCESS, 2023, 11 : 7157 - 7179
[47] Fourier Singular Values-Based False Data Injection Attack Detection in AC Smart-Grids
Dehghani, Moslem
Niknam, Taher
Ghiasi, Mohammad
Siano, Pierluigi
Alhelou, Hassan Haes
Al-Hinai, Amer
APPLIED SCIENCES-BASEL, 2021, 11 (12):
[48] Detection Method for Tolerable False Data Injection Attack Based on Deep Learning Framework
He, Sizhe
Zhou, Yadong
Lv, Xiaoliang
Chen, Wei
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 6717 - 6721
[49] Electricity Theft Detection in a Smart Grid Using Hybrid Deep Learning-Based Data Analysis Technique
Mbey, Camille Franklin
Bikai, Jacques
Souhe, Felix Ghislain Yem
Kakeu, Vinny Junior Foba
Boum, Alexandre Teplaira
JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2024, 2024
[50] Blind False Data Injection Attack Using PCA Approximation Method in Smart Grid
Yu, Zong-Han
Chin, Wen-Long
IEEE TRANSACTIONS ON SMART GRID, 2015, 6 (03) : 1219 - 1226

← 1 2 3 4 5 →