A novel topic clustering algorithm based on graph neural network for question topic diversity

被引:7
|
作者
Wu, Yongliang [1 ]
Wang, Xuejun [1 ]
Zhao, Wenbin [1 ]
Lv, Xiaofeng [2 ]
机构
[1] Shijiazhuang Tiedao Univ, Sch Informat Sci & Technol, Hebei 050043, Peoples R China
[2] Hebei Normal Univ, Coll Comp & Cyber Secur, Hebei 050024, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph Neural Network; Topic clustering; Graph representation; MODEL; KNOWLEDGE; SENTIMENT;
D O I
10.1016/j.ins.2023.02.018
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In community question answering, many questions have no topic labeling or the topic labeling is very diverse, which has become the biggest obstacle to building the bridge between users and posts. Topic clustering methods could alleviate this issue. However, existing research employed words as topic representation units and could not express topic semantic relevance. In this paper, we propose a novel Topic Clustering framework based on the Graph Neural Network (called TCGNN) to alleviate topic diversity in Community Question Answering. Firstly, we separately consider the relationship representation of existing topics and unlabeled topics. For manually labeled topics, we count the frequency of topics in community questions and construct a topic cooccurrence matrix to represent the topic relation. For unmarked topics, we extract the core phrases from community questions and employ them to indicate the topics of questions. Then, we transform the topic co-occurrence matrix into a topic relation graph, optimizing the topic relevance and improving presentation efficiency. Next, we employ a graph neural network for embedding the topic connection graph and get the vector representation of each topic. Finally, an improved K-mean method is proposed for topic clustering based on the distance of topic vectors. Additionally, we briefly discuss the extended effect of topic clustering methods in other domains (bibliographic information and reviews). In the literature we have, it is a primary work that conders topic clustering in multiple situations and offers innovative cogitation to apply graph neural networks in topic clustering. Our experiment compared prevalent clustering methods and some combination methods of text representation and graph embedding. The outcome of experiments on four extensive and varied datasets (Stack Overflow, DBLP, Yelp, and Zhihu) illustrate that TCGNN leads the prevalent baseline in Entropy and Purity.
引用
收藏
页码:685 / 702
页数:18
相关论文
共 50 条
  • [21] Hyperbolic Graph Topic Modeling Network with Continuously Updated Topic Tree
    Zhang, Delvin Ce
    Ying, Rex
    Lauw, Hady W.
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3206 - 3216
  • [22] Topic Oriented Semantic Parsing A topic based question representation
    Sharma, Lokesh Kumar
    Mittal, Namita
    2015 IEEE 9TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2015, : 159 - 164
  • [23] Topic-aware Heterogeneous Graph Neural Network for Link Prediction
    Xu, Siyong
    Yang, Cheng
    Shi, Chuan
    Fang, Yuan
    Guo, Yuxin
    Yang, Tianchi
    Zhang, Luhao
    Hu, Maodi
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2261 - 2270
  • [24] Graph Attention Topic Modeling Network
    Yang, Liang
    Wu, Fan
    Gu, Junhua
    Wang, Chuan
    Cao, Xiaochun
    Jin, Di
    Guo, Yuanfang
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 144 - 154
  • [25] Graph Clustering based Topic Modeling using Feature Learning Approach
    Ganguli, Isha
    Sil, Jaya
    PROCEEDINGS OF THE WORKSHOP PROGRAM OF THE 19TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING (ICDCN'18), 2018,
  • [26] Graph-based topic models for trajectory clustering in crowd videos
    Al Ghamdi, Manal
    Gotoh, Yoshihiko
    MACHINE VISION AND APPLICATIONS, 2020, 31 (05)
  • [27] Graph-based topic models for trajectory clustering in crowd videos
    Manal Al Ghamdi
    Yoshihiko Gotoh
    Machine Vision and Applications, 2020, 31
  • [28] A Graph-Based Approach to Topic Clustering of Tourist Attraction Reviews
    Sirilertworakul, Nuttha
    Yimwadsana, Boonsit
    INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2019, 2019, 1078 : 343 - 354
  • [29] Graph Clustering Based Size Varying Rules for Lifelong Topic Modeling
    Khan, M. Taimoor
    Khalid, Shehzad
    Aziz, Furqan
    ICBRA 2018: PROCEEDINGS OF 2018 5TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS RESEARCH AND APPLICATIONS, 2018, : 73 - 77
  • [30] A Topic-based Dynamic Clustering Algorithm for Text Stream
    Rao, Y.
    Li, X. J.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INDUSTRIAL ENGINEERING (AIIE 2015), 2015, 123 : 480 - 483