Graph Topic Neural Network for Document Representation

被引:21
|
作者
Xie, Qianqian [1 ]
Huang, Jimin [1 ]
Du, Pan [2 ]
Peng, Min [1 ]
Nie, Jian-Yun [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Univ Montreal, Dept Comp Sci & Operat Res, Montreal, PQ, Canada
基金
国家重点研发计划; 美国国家科学基金会;
关键词
graph neural networks; topic models; document representation;
D O I
10.1145/3442381.3450045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Neural Networks (GNNs) such as GCN can effectively learn document representations via the semantic relation graph among documents and words. However, despite a few exceptions, most of the previous work in this line of research does not consider the underlying topical semantics inherited in document contents and the relation graph, making the representations less effective and hard to interpret. In a few recent studies trying to incorporate latent topics into GNNs, the topics have been learned independently from the relation graph modeling. Intuitively, topic extraction can benefit much from the information propagation of the relation graph structure - directly and indirectly connected documents and words have similar topics. In this paper, we propose a novel Graph Topic Neural Network (GTNN) model to mine latent topic semantics for interpretable document representation learning, taking into account the document-document, document-word, and word-word relationships in the graph. We also show that our model can be viewed as semi-amortized inference for relational topic model based on Poisson distribution, with high order correlations. We test our model in several settings: unsupervised, semi-supervised, and supervised representation learning, for both connected and unconnected documents. In all the cases, our model outperforms the state-of-the-art models for these tasks.
引用
收藏
页码:3055 / 3065
页数:11
相关论文
共 50 条
  • [1] Topic Modeling Revisited: A Document Graph-based Neural Network Perspective
    Shen, Dazhong
    Qin, Chuan
    Wang, Chao
    Dong, Zheng
    Zhu, Hengshu
    Xiong, Hui
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [2] Neural Topic Modeling by Incorporating Document Relationship Graph
    Zhou, Deyu
    Hu, Xuemeng
    Wang, Rui
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3790 - 3796
  • [3] Quantum probability-inspired graph neural network for document representation and classification
    Yan, Peng
    Li, Linjing
    Jin, Miaotianzi
    Zeng, Daniel
    NEUROCOMPUTING, 2021, 445 : 276 - 286
  • [4] Graph Structural-topic Neural Network
    Long, Qingqing
    Jin, Yilun
    Song, Guojie
    Li, Yi
    Lin, Wei
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1065 - 1073
  • [5] Learning Dynamic Hierarchical Topic Graph with Graph Convolutional Network for Document Classification
    Wang, Zhengjue
    Wang, Chaojie
    Zhang, Hao
    Duan, Zhibin
    Zhou, Mingyuan
    Cheny, Bo
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
  • [6] Topic Attentional Neural Network for Abstractive Document Summarization
    Liu, Hao
    Zheng, Hai-Tao
    Wang, Wei
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT II, 2019, 11440 : 70 - 81
  • [7] Document mining using graph neural network
    Yong, S. L.
    Hagenbuchner, M.
    Tsoi, A. C.
    Scarselli, F.
    Gori, M.
    COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 458 - 472
  • [8] Geodesic Graph Neural Network for Efficient Graph Representation Learning
    Kong, Lecheng
    Chen, Yixin
    Zhang, Muhan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [9] Graph Neural Network for Symbol Detection on Document Images
    Renton, Guillaume
    Heroux, Pierre
    Adam, Sebastien
    Gauzere, Benoit
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 13TH IAPR INTERNATIONAL WORKSHOP ON GRAPHICS RECOGNITION (GREC 2019), VOL 1, 2019, : 62 - 67
  • [10] A novel topic clustering algorithm based on graph neural network for question topic diversity
    Wu, Yongliang
    Wang, Xuejun
    Zhao, Wenbin
    Lv, Xiaofeng
    INFORMATION SCIENCES, 2023, 629 : 685 - 702