Towards unsupervised keyphrase extraction via an autoregressive approach

被引:1
|
作者
Li, Tuohang [1 ]
Hu, Liang [1 ]
Li, Hongtu [1 ]
Sun, Chengyu [1 ]
Li, Shuai [1 ]
Chi, Ling [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Keyphrase extraction; Autoregressive structure; Optimizer; Unsupervised model; Coverage decay optimizer;
D O I
10.1016/j.knosys.2023.110664
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyphrase extraction is a technique used to capture the core information of documents and is an upstream task for advanced information retrieval systems, particularly in the academic realm. Current unsupervised methods are primarily built on a score-and-rank framework with a consistent inability to acquire mutual information between extracted keyphrases, especially with graph-based models. Utilizing the autoregressive structure that is typically used in sequence-to-sequence text generation models, we propose a plug-and-play optimizer named C-Decay that can be integrated into any graph -based unsupervised keyphrase extraction model for a stable performance boost, and that mitigates the bias of certain semantically or lexically dominant tokens by optimizing the origin score distribution output by graph-based models directly. The architecture of C-Decay includes the keyphrase pool, the gain vector and the decay factor, where the keyphrase pool is designed to realize an autoregressive structure and the gain vector and the decay factor are the optimization operator. Herein, we examine three graph-based models integrated with C-Decay, and the experiment is conducted on four datasets KDD, Semeval, Nguyen, and Krapivin. Moreover, we prove that C-Decay can improve accuracy and F-Measure by an average of approximately 50% and 20%, respectively.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Generative non-autoregressive unsupervised keyphrase extraction with neural topic modeling
    Zhu, Xun
    Lou, Yinxia
    Zhao, Jing
    Gao, Wang
    Deng, Hongtao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [2] Unsupervised Keyphrase Extraction via Interpretable Neural Networks
    Joshi, Rishabh
    Balachandran, Vidhisha
    Saldanha, Emily
    Glenski, Maria
    Volkova, Svitlana
    Tsvetkov, Yulia
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1107 - 1119
  • [3] HAKE: an Unsupervised Approach to Automatic Keyphrase Extraction for Multiple Domains
    Merrouni, Zakariae Alami
    Frikh, Bouchra
    Ouhbi, Brahim
    COGNITIVE COMPUTATION, 2022, 14 (02) : 852 - 874
  • [4] PositionRank: An Unsupervised Approach to Keyphrase Extraction from Scholarly Documents
    Florescu, Corina
    Caragea, Cornelia
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1105 - 1115
  • [5] A Fuzzy Approach to Improve an Unsupervised Automatic Keyphrase Extraction Process
    Perez-Guadarrama, Yamel
    Simon-Cuevas, Alfredo
    Hojas-Mazo, Wenny
    Olivas, Jose A.
    Romero, Francisco P.
    2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
  • [6] HAKE: an Unsupervised Approach to Automatic Keyphrase Extraction for Multiple Domains
    Zakariae Alami Merrouni
    Bouchra Frikh
    Brahim Ouhbi
    Cognitive Computation, 2022, 14 : 852 - 874
  • [7] AdaptiveUKE: Towards adaptive unsupervised keyphrase extraction with gated topic modeling
    Liu, Qi
    Ke, Wenjun
    Yuan, Xiaoguang
    Yang, Yuting
    Zhao, Hua
    Wang, Peng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [8] Keyphrase Distance Analysis Technique from News Articles as a Feature for Keyphrase Extraction: An Unsupervised Approach
    Miah, Mohammad Badrul Alam
    Awang, Suryanti
    Rahman, Md Mustafizur
    Hosen, A. S. M. Sanwar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 995 - 1002
  • [9] TripleRank: An unsupervised keyphrase extraction algorithm
    Li, Tuohang
    Hu, Liang
    Li, Hongtu
    Sun, Chengyu
    Li, Shuai
    Chi, Ling
    KNOWLEDGE-BASED SYSTEMS, 2021, 219 (219)
  • [10] Unsupervised Keyphrase Extraction by Learning Neural Keyphrase Set Function
    Song, Mingyang
    Jiang, Haiyun
    Liu, Lemao
    Shi, Shuming
    Jing, Liping
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 2482 - 2494