Contrastive Learning for Neural Topic Model

被引:0
|
作者
Thong Nguyen [1 ]
Luu Anh Tuan [2 ]
机构
[1] VinAI Res, Hanoi, Vietnam
[2] Nanyang Technol Univ, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent empirical studies show that adversarial topic models (ATM) can successfully capture semantic patterns of the document by differentiating a document with another dissimilar sample. However, utilizing that discriminative-generative architecture has two important drawbacks: (1) the architecture does not relate similar documents, which has the same document-word distribution of salient words; (2) it restricts the ability to integrate external information, such as sentiments of the document, which has been shown to benefit the training of neural topic model. To address those issues, we revisit the adversarial topic architecture in the viewpoint of mathematical analysis, propose a novel approach to re-formulate discriminative goal as an optimization problem, and design a novel sampling method which facilitates the integration of external variables. The reformulation encourages the model to incorporate the relations among similar samples and enforces the constraint on the similarity among dissimilar ones; while the sampling method, which is based on the internal input and reconstructed output, helps inform the model of salient words contributing to the main topic. Experimental results show that our framework outperforms other state-of-the-art neural topic models in three common benchmark datasets that belong to various domains, vocabulary sizes, and document lengths in terms of topic coherence.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Partition Semantics and Pragmatics of Contrastive Topic
    Yabushita, Katsuhiko
    CONTRASTIVENESS IN INFORMATION STRUCTURE, ALTERNATIVES AND SCALAR IMPLICATURES, 2017, 91 : 23 - 45
  • [42] The contrastive topic requirement on specificational subjects
    Milway, Daniel
    CANADIAN JOURNAL OF LINGUISTICS-REVUE CANADIENNE DE LINGUISTIQUE, 2020, 65 (02): : 181 - 215
  • [43] Topic shifters in Romanian: A contrastive analysis
    Ionescu, Alice
    JOURNAL OF PRAGMATICS, 2020, 156 : 110 - 120
  • [44] Dependency-Aware Neural Topic Model
    Huang, Heyan
    Tang, Yi-Kun
    Shi, Xuewen
    Mao, Xian-Ling
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (01)
  • [45] ConCas: Cascade Popularity Prediction Based on Topic-Aware Graph Contrastive Learning
    Ling, Chen
    Zhang, Xianren
    Shang, Jiaxing
    Liu, Dajiang
    Li, Yong
    Xie, Wu
    Qiang, Baohua
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2022, 13368 : 516 - 528
  • [46] Graph neural topic model with commonsense knowledge
    Zhu, Bingshan
    Cai, Yi
    Ren, Haopeng
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (02)
  • [47] Topic-Awared Contrastive Learning for Incoming Fake News Detection in News Streams
    Zhang, Yongcheng
    Xiang, Changpeng
    Ren, Kai
    Wei, Xiaomei
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT V, NLPCC 2024, 2025, 15363 : 43 - 54
  • [48] Tree-Structured Neural Topic Model
    Isonuma, Masaru
    Mori, Junichiro
    Bollegala, Danushka
    Sakata, Ichiro
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 800 - 806
  • [49] Neural Variational Gaussian Mixture Topic Model
    Tang, Kun
    Huang, Heyan
    Shi, Xuewen
    Mao, Xian-Ling
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (04)
  • [50] ATM: Adversarial-neural Topic Model
    Wang, Rui
    Zhou, Deyu
    He, Yulan
    INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (06)