A novel topic clustering algorithm based on graph neural network for question topic diversity

被引:7
|
作者
Wu, Yongliang [1 ]
Wang, Xuejun [1 ]
Zhao, Wenbin [1 ]
Lv, Xiaofeng [2 ]
机构
[1] Shijiazhuang Tiedao Univ, Sch Informat Sci & Technol, Hebei 050043, Peoples R China
[2] Hebei Normal Univ, Coll Comp & Cyber Secur, Hebei 050024, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph Neural Network; Topic clustering; Graph representation; MODEL; KNOWLEDGE; SENTIMENT;
D O I
10.1016/j.ins.2023.02.018
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In community question answering, many questions have no topic labeling or the topic labeling is very diverse, which has become the biggest obstacle to building the bridge between users and posts. Topic clustering methods could alleviate this issue. However, existing research employed words as topic representation units and could not express topic semantic relevance. In this paper, we propose a novel Topic Clustering framework based on the Graph Neural Network (called TCGNN) to alleviate topic diversity in Community Question Answering. Firstly, we separately consider the relationship representation of existing topics and unlabeled topics. For manually labeled topics, we count the frequency of topics in community questions and construct a topic cooccurrence matrix to represent the topic relation. For unmarked topics, we extract the core phrases from community questions and employ them to indicate the topics of questions. Then, we transform the topic co-occurrence matrix into a topic relation graph, optimizing the topic relevance and improving presentation efficiency. Next, we employ a graph neural network for embedding the topic connection graph and get the vector representation of each topic. Finally, an improved K-mean method is proposed for topic clustering based on the distance of topic vectors. Additionally, we briefly discuss the extended effect of topic clustering methods in other domains (bibliographic information and reviews). In the literature we have, it is a primary work that conders topic clustering in multiple situations and offers innovative cogitation to apply graph neural networks in topic clustering. Our experiment compared prevalent clustering methods and some combination methods of text representation and graph embedding. The outcome of experiments on four extensive and varied datasets (Stack Overflow, DBLP, Yelp, and Zhihu) illustrate that TCGNN leads the prevalent baseline in Entropy and Purity.
引用
收藏
页码:685 / 702
页数:18
相关论文
共 50 条
  • [41] Topic discovery method based on topic model combined with hierarchical clustering
    Wang, An
    Zhang, Junjie
    PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 814 - 818
  • [42] Graph-based Correlated Topic Model for Trajectory Clustering in Crowded Videos
    Al Ghamdi, Manal
    Gotoh, Yoshihiko
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1029 - 1037
  • [43] Topic distillation and clustering algorithm based on the topology of pages-keywords
    Deng, Jian-Shuang
    Zheng, Qi-Lun
    Peng, Hong
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1581 - +
  • [44] Question Answering Algorithm for Grid Fault Diagnosis based on Graph Neural Network
    Yu, Yahan
    Wang, Yun
    Zhang, Guigang
    Yang, Yi
    Wang, Jian
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 552 - 557
  • [45] Topic-Enhanced Multi-level Graph Neural Network for Session-Based Recommendation
    Tang G.
    Zhu X.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (02): : 174 - 186
  • [46] TOPIC MODELING BASED ON ATTRIBUTED GRAPH
    Zhang Lidan
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [47] Research on Topic Detection of Network Public Opinion Based on Hierarchical Clustering
    Liu, Lu
    Jiang, Zheng-tao
    INTERNATIONAL CONFERENCE ON SIMULATION, MODELLING AND MATHEMATICAL STATISTICS (SMMS 2015), 2015, : 291 - 295
  • [48] Analytics and visualization of citation network applying topic-based clustering
    Nakazawa, Rina
    Itoh, Takayuki
    Saito, Takafumi
    JOURNAL OF VISUALIZATION, 2018, 21 (04) : 681 - 693
  • [49] Analytics and visualization of citation network applying topic-based clustering
    Rina Nakazawa
    Takayuki Itoh
    Takafumi Saito
    Journal of Visualization, 2018, 21 : 681 - 693
  • [50] Image Annotation Based on Convolutional Neural Network and Topic Model
    Zhang Lei
    Cai Ming
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (20)