A novel topic clustering algorithm based on graph neural network for question topic diversity

被引:7
|
作者
Wu, Yongliang [1 ]
Wang, Xuejun [1 ]
Zhao, Wenbin [1 ]
Lv, Xiaofeng [2 ]
机构
[1] Shijiazhuang Tiedao Univ, Sch Informat Sci & Technol, Hebei 050043, Peoples R China
[2] Hebei Normal Univ, Coll Comp & Cyber Secur, Hebei 050024, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph Neural Network; Topic clustering; Graph representation; MODEL; KNOWLEDGE; SENTIMENT;
D O I
10.1016/j.ins.2023.02.018
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In community question answering, many questions have no topic labeling or the topic labeling is very diverse, which has become the biggest obstacle to building the bridge between users and posts. Topic clustering methods could alleviate this issue. However, existing research employed words as topic representation units and could not express topic semantic relevance. In this paper, we propose a novel Topic Clustering framework based on the Graph Neural Network (called TCGNN) to alleviate topic diversity in Community Question Answering. Firstly, we separately consider the relationship representation of existing topics and unlabeled topics. For manually labeled topics, we count the frequency of topics in community questions and construct a topic cooccurrence matrix to represent the topic relation. For unmarked topics, we extract the core phrases from community questions and employ them to indicate the topics of questions. Then, we transform the topic co-occurrence matrix into a topic relation graph, optimizing the topic relevance and improving presentation efficiency. Next, we employ a graph neural network for embedding the topic connection graph and get the vector representation of each topic. Finally, an improved K-mean method is proposed for topic clustering based on the distance of topic vectors. Additionally, we briefly discuss the extended effect of topic clustering methods in other domains (bibliographic information and reviews). In the literature we have, it is a primary work that conders topic clustering in multiple situations and offers innovative cogitation to apply graph neural networks in topic clustering. Our experiment compared prevalent clustering methods and some combination methods of text representation and graph embedding. The outcome of experiments on four extensive and varied datasets (Stack Overflow, DBLP, Yelp, and Zhihu) illustrate that TCGNN leads the prevalent baseline in Entropy and Purity.
引用
收藏
页码:685 / 702
页数:18
相关论文
共 50 条
  • [1] A Novel Graph Based Clustering Approach to Document Topic Modeling
    Chanda, Prateek
    Das, Asit Kumar
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [2] Graph Topic Neural Network for Document Representation
    Xie, Qianqian
    Huang, Jimin
    Du, Pan
    Peng, Min
    Nie, Jian-Yun
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 3055 - 3065
  • [3] Graph Structural-topic Neural Network
    Long, Qingqing
    Jin, Yilun
    Song, Guojie
    Li, Yi
    Lin, Wei
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1065 - 1073
  • [4] Topic Mining Based on Graph Local Clustering
    Garza Villarreal, Sara Elena
    Brena, Ramon F.
    ADVANCES IN SOFT COMPUTING, PT II, 2011, 7095 : 201 - +
  • [5] Analyzing temporal patterns of topic diversity using graph clustering
    Takako Hashimoto
    David Lawrence Shepard
    Tetsuji Kuboyama
    Kilho Shin
    Ryota Kobayashi
    Takeaki Uno
    The Journal of Supercomputing, 2021, 77 : 4375 - 4388
  • [6] Analyzing temporal patterns of topic diversity using graph clustering
    Hashimoto, Takako
    Shepard, David Lawrence
    Kuboyama, Tetsuji
    Shin, Kilho
    Kobayashi, Ryota
    Uno, Takeaki
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (05): : 4375 - 4388
  • [7] External information enhancing topic model based on graph neural network
    Song, Jie
    Lu, Xiaoling
    Hong, Jingya
    Wang, Feifei
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263
  • [8] A Network Decomposition-based Text Clustering Algorithm for Topic Detection
    Meng, Zuqiang
    Shen, Shimo
    Chen, Qiulian
    MEASUREMENT TECHNOLOGY AND ITS APPLICATION, PTS 1 AND 2, 2013, 239-240 : 1318 - 1323
  • [9] Cycling topic graph learning for neural topic modeling
    Liu, Yanyan
    Gong, Zhiguo
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [10] A Novel Approach of Neural Topic Modelling for Document Clustering
    Subramani, Sandhya
    Sridhar, Vaishnavi
    Shetty, Kaushal
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 2169 - 2173