Unsupervised Latent Dirichlet Allocation for supervised question classification

被引:38
|
作者
Momtazi, Saeedeh [1 ]
机构
[1] Amirkabir Univ Technol, Dept Comp Engn & Informat Technol, Tehran, Iran
关键词
Community-based QA; Question classification; LDA; MODELS;
D O I
10.1016/j.ipm.2018.01.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Question answering systems assist users in satisfying their information needs more precisely by providing focused responses to their questions. Among the various systems developed for such a purpose, community-based question answering has recently received researchers' attention due to the large amount of user-generated questions and answers in social question-and-answer platforms. Reusing such data sources requires an accurate information retrieval component enhanced by a question classifier. The question classification gives the system the possibility to have information about question categories to focus on questions and answers from relevant categories to the input question. In this paper, we propose a new method based on unsupervised Latent Dirichlet Allocation for classifying questions in community-based question answering. Our method first uses unsupervised topic modeling to extract topics from a large amount of unlabeled data. The learned topics are then used in the training phase to find their association with the available category labels in the training data. The category mixture of topics is finally used to predict the label of unseen data.
引用
收藏
页码:380 / 393
页数:14
相关论文
共 50 条
  • [21] Multilayer classification of web pages using Random Forest and semi-supervised Latent Dirichlet Allocation
    Sayadi, Karim
    Bui, Quang Vu
    Bui, Marc
    2015 15TH INTERNATIONAL CONFERENCE ON INNOVATIONS FOR COMMUNITY SERVICES (I4CS), 2015,
  • [22] Scene classification using class-supervised local-space-constraint latent Dirichlet allocation
    Huang, Chao
    Luo, Wang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (17) : 10227 - 10240
  • [23] Inference Algorithms in Latent Dirichlet Allocation for Semantic Classification
    Zubir, Wan Mohammad Aflah Mohammad
    Aziz, Izzatdin Abdul
    Jaafar, Jafreezal
    Hasan, Mohd Hilmi
    APPLIED COMPUTATIONAL INTELLIGENCE AND MATHEMATICAL METHODS: COMPUTATIONAL METHODS IN SYSTEMS AND SOFTWARE 2017, VOL. 2, 2018, 662 : 173 - 184
  • [24] THE VARIANT OF LATENT DIRICHLET ALLOCATION FOR NATURAL SCENE CLASSIFICATION
    Tang Yingjun
    COMPUTING AND INFORMATICS, 2011, 30 (02) : 311 - 319
  • [25] A Hybrid Latent Dirichlet Allocation Approach for Topic Classification
    Hsu, Chi-I
    Chiu, Chaochang
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 312 - 315
  • [26] Semi supervised classification of scientific and technical literature based on semi supervised hierarchical description of improved latent dirichlet allocation (LDA)
    Yongjun Zhang
    Jialin Ma
    Zijian Wang
    Cluster Computing, 2019, 22 : 6881 - 6889
  • [27] Semi supervised classification of scientific and technical literature based on semi supervised hierarchical description of improved latent dirichlet allocation (LDA)
    Zhang, Yongjun
    Ma, Jialin
    Wang, Zijian
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3): : S6881 - S6889
  • [28] Latent Dirichlet Allocation for Unsupervised Activity Analysis on an Autonomous Mobile Robot
    Duckworth, Paul
    Alomari, Muhannad
    Charles, James
    Hogg, David C.
    Cohn, Anthony G.
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3819 - 3826
  • [29] A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery
    van der Merwe, Werner
    Kamper, Herman
    du Preez, Johan
    INTERSPEECH 2022, 2022, : 1426 - 1430
  • [30] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022