TLATR: Automatic Topic Labeling Using Automatic (Domain-Specific) Term Recognition

被引:5
|
作者
Truica, Ciprian-Octavian [1 ]
Apostol, Elena-Simona [1 ]
机构
[1] Univ Politehn Bucuresti, Fac Automat Control & Comp, Dept Comp Sci & Engn, Bucharest 060042, Romania
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Labeling; Task analysis; Mutual information; Semantics; Indexes; Computational modeling; Bit error rate; Automatic term recognition; automatic topic labeling evaluation; topic labeling; topic modeling; INFORMATION-RETRIEVAL; PROBABILISTIC MODEL;
D O I
10.1109/ACCESS.2021.3083000
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Topic modeling is a probabilistic graphical model for discovering latent topics in text corpora by using multinomial distributions of topics over words. Topic labeling is used to assign meaningful labels for the discovered topics. In this paper, we present a new topic labeling method that uses automatic term recognition to discover and assign relevant labels for each topic, i.e., TLATR (Topic Labeling using Automatic Term Recognition). TLATR uses domain-specific multi-terms that appear in the set of documents belonging to a topic. The multi-term having the highest score as determined by the automatic term recognition algorithm is chosen as the label for that topic. To evaluate TLATR, we use two real, publicly available datasets that contain scientific articles' abstracts. The topic label evaluation is done both automatically and using human annotators. For the automatic evaluation, we use Pointwise Mutual Information, Normalized Pointwise Mutual Information, and document similarity. For human evaluation, we employ the average rating method. Furthermore, we also evaluate the quality of the topic models using the Adjusted Rand Index. To prove that our novel method extracts relevant topic labels, we compare TLATR with two state-of-the-art methods, one supervised and one unsupervised, provided by the NETL Automatic Topic Labelling system. The experimental results show that our method outperforms or provides similar results with both NETL's supervised and unsupervised approaches.
引用
收藏
页码:76624 / 76641
页数:18
相关论文
共 50 条
  • [41] Automatic Domain-specific Corpora Generation from Wikipedia - A Replication Study
    Ruwanpura, Seniru
    Morash, Cale
    Khan, Momin Ali
    Ahmad, Adnan
    Ginde, Gouri
    2023 IEEE 31ST INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS, REW, 2023, : 85 - 94
  • [42] Automatic Domain-Specific SoC Design for Autonomous Unmanned Aerial Vehicles
    Krishnan, Srivatsan
    Wan, Zishen
    Bhardwaj, Kshitij
    Whatmoug, Paul
    Faust, Aleksandra
    Neuman, Sabrina
    Wei, Gu-Yeon
    Brooks, David
    Reddi, Vijay Janapa
    2022 55TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2022, : 300 - 317
  • [43] Automatic Expansion of Domain-Specific Affective Models for Web Intelligence Applications
    Albert Weichselbraun
    Jakob Steixner
    Adrian M.P. Braşoveanu
    Arno Scharl
    Max Göbel
    Lyndon J. B. Nixon
    Cognitive Computation, 2022, 14 : 228 - 245
  • [44] Automatic Expansion of Domain-Specific Affective Models for Web Intelligence Applications
    Weichselbraun, Albert
    Steixner, Jakob
    Brasoveanu, Adrian M. P.
    Scharl, Arno
    Goebel, Max
    Nixon, Lyndon J. B.
    COGNITIVE COMPUTATION, 2022, 14 (01) : 228 - 245
  • [45] An Automatic Approach for Domain-specific Dictionary Expansion Based on Web Mining
    Sun, Yueheng
    Ni, Weijie
    Men, Rui
    2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 2, 2009, : 96 - 99
  • [46] Automatic Labeling of Topic Models Using Graph-Based Ranking
    He, Dongbin
    Wang, Minjuan
    Khattak, Abdul Mateen
    Zhang, Li
    Gao, Wanlin
    IEEE ACCESS, 2019, 7 : 131593 - 131608
  • [47] Automatic noise reduction of domain-specific bibliographic datasets using positive-unlabeled learning
    Guo Chen
    Jing Chen
    Yu Shao
    Lu Xiao
    Scientometrics, 2023, 128 : 1187 - 1204
  • [48] Regulation of Automatic Imitation: Domain-Specific Versus Domain-General Control Processes
    Bouquet, Cedric A.
    van der Wel, Robrecht P. R. D.
    Lafleur, Melissa
    Duffy, Sean
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2025, 51 (01) : 32 - 49
  • [49] Automatic construction of domain-specific sentiment lexicon for unsupervised domain adaptation and sentiment classification
    Beigi, Omid Mohamad
    Moattar, Mohammad H.
    KNOWLEDGE-BASED SYSTEMS, 2021, 213
  • [50] Automatic noise reduction of domain-specific bibliographic datasets using positive-unlabeled learning
    Chen, Guo
    Chen, Jing
    Shao, Yu
    Xiao, Lu
    SCIENTOMETRICS, 2023, 128 (02) : 1187 - 1204