Data Augmentation and Application of Defect Texts for Power Equipment Based on Knowledge Integration Manifold

被引:0
|
作者
Wang X. [1 ]
Gu Y. [1 ]
Zhang H. [1 ]
Liu L. [2 ]
Liu H. [1 ]
Li Q. [1 ]
机构
[1] Shandong Provincial Key Laboratory of UHV Transmission Technology and Equipment, Shandong University, Shandong Province, Jinan
[2] State Grid Shandong Electric Power Company Laiwu Power Supply Company, Shandong Province, Jinan
来源
关键词
data augmentation; data filtering; knowledge integration; power equipment defect text;
D O I
10.13335/j.1000-3673.pst.2023.0713
中图分类号
学科分类号
摘要
With the digital transformation and upgrade of the power grids, the intelligent operation and maintenance technology of the power equipment has developed rapidly. During the operation and maintenance process, a large number of defect texts containing important information of the power grids have been accumulated. Due to the sparseness of text data labels, as well as the fuzziness and diversity of the literal descriptions, it is difficult to effectively mine the operation and maintenance information in power texts. A data augmentation of the defect texts for the power equipment is proposed. Firstly, the defect text data sets are used to fine-tune the pre-training model ERNIE(enhanced representation through knowledge integration)with the multi-stage knowledge mask strategy, integrating electrical expertise into dynamic encoding of defect texts. Secondly, on the basis of manifold assumption, the destruction and reconstruction functions are designed based on the denoising autoencoder. The destruction function is constructed according to the mask unit selection strategy based on the information value, and the reconstruction function is constructed based on the fine-tuned ERNIE. The enhanced samples are obtained during the process of the destruction and reconstruction. Then, the augmented data is selected based on the influence function and the diversity measures, filtering out the samples with poor data quality and high repetition. Finally, the augmented data is applied to various text mining tasks through a multi-layer training framework. Results show that the algorithm is able to greatly improve the effect of the defect text mining, and can be widely and flexibly applied in a variety of power equipment defect text mining tasks. © 2024 Power System Technology Press. All rights reserved.
引用
收藏
页码:1690 / 1699
页数:9
相关论文
共 31 条
  • [1] SHAO Guanyu, WANG Huifang,HE Benteng, Quality assessment and improvement method for power grid equipment defect text[J], Power System Technology, 43, 4, pp. 1472-1479, (2019)
  • [2] TIAN Jiapeng, SONG Hui, CHEN Lifan, Entity recognition approach of equipment failure text for knowledge graph construction [J], Power System Technology, 46, 10, pp. 3913-3922, (2022)
  • [3] QIAO Ji, WANG Xinying, Framework and key technologies of knowledge-graph-based fault handling system in power grid[J], Proceedings of the CSEE, 40, 18, pp. 5837-5848, (2020)
  • [4] SHAO Guanyu, WANG Huifang, WU Xianghong, Precise information identification method of power equipment defect text based on dependency parsing[J], Automation of Electric Power Systems, 44, 12, pp. 178-185, (2020)
  • [5] YE Xinzhi, SHANG Lei, DONG Xuzhu, Knowledge graph for distribution network fault handling[J], Power System Technology, 46, 10, pp. 3739-3748, (2022)
  • [6] Chen JIANG, WANG Yuan, HU Junhua, Power entity information recognition based on deep learning[J], Power System Technology, 45, 6, pp. 2141-2149, (2021)
  • [7] GUO Rong, YANG Qun, LIU Shaohan, Construction and application of power grid fault handing knowledge graph[J], Power System Technology, 45, 6, pp. 2092-2100, (2021)
  • [8] WANG Jundong, YANG Jun,PEI Yangzhou, Distribution network fault assistant decision-making based on knowledge graph[J], Power System Technology, 45, 6, pp. 2101-2112, (2021)
  • [9] Qing XIE, Yang CAI, XIE Jun, Research on construction method and application of knowledge graph for power transformer operation and maintenance based on ALBERT[J], Transactions of China Electrotechnical Society, 38, 1, pp. 95-106, (2023)
  • [10] WANG Kai, ZHAO Gang, GONG Xiaocheng, Semi-supervised entity recognition method for power dispatching knowledge modeling [J], Power System Technology, 47, 9, pp. 3855-3864, (2023)