A novel topic clustering algorithm based on graph neural network for question topic diversity

被引:7
|
作者
Wu, Yongliang [1 ]
Wang, Xuejun [1 ]
Zhao, Wenbin [1 ]
Lv, Xiaofeng [2 ]
机构
[1] Shijiazhuang Tiedao Univ, Sch Informat Sci & Technol, Hebei 050043, Peoples R China
[2] Hebei Normal Univ, Coll Comp & Cyber Secur, Hebei 050024, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph Neural Network; Topic clustering; Graph representation; MODEL; KNOWLEDGE; SENTIMENT;
D O I
10.1016/j.ins.2023.02.018
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In community question answering, many questions have no topic labeling or the topic labeling is very diverse, which has become the biggest obstacle to building the bridge between users and posts. Topic clustering methods could alleviate this issue. However, existing research employed words as topic representation units and could not express topic semantic relevance. In this paper, we propose a novel Topic Clustering framework based on the Graph Neural Network (called TCGNN) to alleviate topic diversity in Community Question Answering. Firstly, we separately consider the relationship representation of existing topics and unlabeled topics. For manually labeled topics, we count the frequency of topics in community questions and construct a topic cooccurrence matrix to represent the topic relation. For unmarked topics, we extract the core phrases from community questions and employ them to indicate the topics of questions. Then, we transform the topic co-occurrence matrix into a topic relation graph, optimizing the topic relevance and improving presentation efficiency. Next, we employ a graph neural network for embedding the topic connection graph and get the vector representation of each topic. Finally, an improved K-mean method is proposed for topic clustering based on the distance of topic vectors. Additionally, we briefly discuss the extended effect of topic clustering methods in other domains (bibliographic information and reviews). In the literature we have, it is a primary work that conders topic clustering in multiple situations and offers innovative cogitation to apply graph neural networks in topic clustering. Our experiment compared prevalent clustering methods and some combination methods of text representation and graph embedding. The outcome of experiments on four extensive and varied datasets (Stack Overflow, DBLP, Yelp, and Zhihu) illustrate that TCGNN leads the prevalent baseline in Entropy and Purity.
引用
收藏
页码:685 / 702
页数:18
相关论文
共 50 条
  • [11] A Novel Hybrid Clustering Algorithm for Microblog Topic Detection
    Geng, Xiao
    Zhang, Yanmei
    Jiao, Yuhang
    Mei, Yinan
    2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, RESOURCE AND ENVIRONMENTAL ENGINEERING (MSREE 2017), 2017, 1890
  • [12] Topic Modeling Revisited: A Document Graph-based Neural Network Perspective
    Shen, Dazhong
    Qin, Chuan
    Wang, Chao
    Dong, Zheng
    Zhu, Hengshu
    Xiong, Hui
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [13] A Multifocal Graph-Based Neural Network Scheme for Topic Event Extraction
    Wan, Qizhi
    Wan, Changxuan
    Xiao, Keli
    Hu, Rong
    Liu, Dexi
    Liao, Guoqiong
    Liu, Xiping
    Shuai, Yuxin
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2025, 43 (01)
  • [14] Topic Clustering for Social Media Texts with Heterogeneous Graph Neural Networks
    Xiaodong F.
    Kangxin H.
    Data Analysis and Knowledge Discovery, 2022, 6 (10) : 9 - 19
  • [15] User clustering topic recommendation algorithm based on two phase in the social network
    Pei, Li
    Xiaoying, Pan
    Hao, Chen
    International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (10): : 233 - 246
  • [16] A Semantic Graph based Topic Model for Question Retrieval in Community Question Answering
    Chen, Long
    Jose, Joemon M.
    Yu, Haitao
    Yuan, Fajie
    Zhang, Dell
    PROCEEDINGS OF THE NINTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'16), 2016, : 287 - 296
  • [17] Topic text detection by clustering algorithm for social network media
    Sha S.
    International Journal of Networking and Virtual Organisations, 2024, 30 (03) : 246 - 256
  • [18] A novel text clustering model based on topic modelling and social network analysis
    Amiri, Babak
    Karimianghadim, Ramin
    CHAOS SOLITONS & FRACTALS, 2024, 181
  • [19] A Novel Hybrid Clustering Algorithm for Topic Detection on Chinese Microblogging
    Geng, Xiao
    Zhang, Yanmei
    Jiao, Yuhang
    Mei, Yinan
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (02): : 289 - 300
  • [20] Topic-Selective Graph Network for Topic-Focused Summarization
    Shi, Zesheng
    Zhou, Yucheng
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT IV, 2023, 13938 : 247 - 259