Improved graph node embedding and clustering method for fault short text

被引:0
|
作者
Qiu J. [1 ,2 ]
Sun L. [1 ,2 ]
Han M. [1 ,2 ]
机构
[1] School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu
[2] Mfg. Indust. Chains Collaboration and Info. Support Technology Key Laboratory of Sichuan Province, Chengdu
关键词
fault short text; graph node clustering; graph node embedding; local density;
D O I
10.13196/j.cims.2022.1045
中图分类号
学科分类号
摘要
To effectively mine the cross-text vocabulary association in fault short text, the global feature representation of fault entity nodes was constructed, and the fault entity node clustering label was obtained. An improved graph node embedding and clustering method for fault short text was proposed. In this method, the calculation method of edge weight was innovated in the process of graph construction to distinguish the association between words with different distances under the same window. The graph node structure feature acquisition method was improved to reflect the influence of node value differences on embedding. Then, the structural features and relational features of nodes were fused to enhance the similarity between nodes with similar neighbor nodes. In the clustering stage, a parameter called alternative nodes number was designed to alleviate the sensitivity of cut-off distance. The parameter analysis and performance evaluation were carried out on the open data set and real business data, and the results showed that the proposed method could obtain accurate and effective clustering results of fault entity nodes. © 2023 CIMS. All rights reserved.
引用
收藏
页码:4257 / 4266
页数:9
相关论文
共 34 条
  • [1] SUN Linfu, GAO Xueqin, Technology services and value chain collaborative business technology resources[[M], (2021)
  • [2] REN Chunhua, SUN Linfu, Dynamic evaluation of three-stage subjective and objective fair weights for collaborative enterprise groups, Computer Integrated Manufacturing Systems, 25, 11, pp. 2874-2891, (2019)
  • [3] YU Yang, SUN Linfu, REN Chunhua, Et al., Bilateral matching model of business resources for multi-service value chain, Computer Integrated Manufacturing Systems, 27, 5, pp. 1397-1409, (2021)
  • [4] LIU Pengcheng, SUN Linfu, Knowledge graph construction for multi-service value chains based on third-party cloud plat-form, Computer Integrated Manufacturing Systems, 28, 2, pp. 612-627, (2022)
  • [5] ZHAO Xianglong, Construction of vehicle fault knowledge graph based on third-party cloud platform [D], (2019)
  • [6] YAN X, GUO J, LAN Y, Et al., A blterm topic model for short texts, Proceedings of the 22nd International Conference on World Wide Web, pp. 1445-1456, (2013)
  • [7] YIN J, WANG J., A Dirichlet multinomial mixture model-based approach for short text clustering, Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 233-242, (2014)
  • [8] XU J, XU B, WANG P, Et al., Self-Taught convolutional neural networks for short text clustering, Neural Networks, 88, pp. 22-31, (2017)
  • [9] WANG F, XU B, XU J, Et al., Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification, Neurocomputing, 174, pp. 806-814, (2016)
  • [10] CHEN J, GONG Z, WANG W, Et al., Inductive document representation learning for short text clustering, Pro-ceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, (2021)