Unsupervised Latent Dirichlet Allocation for supervised question classification

被引:38
|
作者
Momtazi, Saeedeh [1 ]
机构
[1] Amirkabir Univ Technol, Dept Comp Engn & Informat Technol, Tehran, Iran
关键词
Community-based QA; Question classification; LDA; MODELS;
D O I
10.1016/j.ipm.2018.01.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Question answering systems assist users in satisfying their information needs more precisely by providing focused responses to their questions. Among the various systems developed for such a purpose, community-based question answering has recently received researchers' attention due to the large amount of user-generated questions and answers in social question-and-answer platforms. Reusing such data sources requires an accurate information retrieval component enhanced by a question classifier. The question classification gives the system the possibility to have information about question categories to focus on questions and answers from relevant categories to the input question. In this paper, we propose a new method based on unsupervised Latent Dirichlet Allocation for classifying questions in community-based question answering. Our method first uses unsupervised topic modeling to extract topics from a large amount of unlabeled data. The learned topics are then used in the training phase to find their association with the available category labels in the training data. The category mixture of topics is finally used to predict the label of unseen data.
引用
收藏
页码:380 / 393
页数:14
相关论文
共 50 条
  • [1] Unsupervised Latent Dirichlet Allocation for supervised question classification (vol 54, pg 380, 2018)
    Momtazi, Saeedeh
    Gurevych, Iryna
    INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (03) : 1080 - 1080
  • [2] INFERENCE IN SUPERVISED LATENT DIRICHLET ALLOCATION
    Lakshminarayanan, Balaji
    Raich, Raviv
    2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
  • [3] Unsupervised Object Localization with Latent Dirichlet Allocation
    Yang, Tong-feng
    Ma, Jun
    2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (ICCSAI 2013), 2013, : 230 - 234
  • [4] Unsupervised Feature Selection for Latent Dirichlet Allocation
    Xu Weiran
    Du Gang
    Chen Guang
    Guo Jun
    Yang Jie
    CHINA COMMUNICATIONS, 2011, 8 (05) : 54 - 62
  • [5] Semi-Supervised Latent Dirichlet Allocation and its Application for Document Classification
    Wang, Di
    Thint, Marcus
    Al-Rubaie, Ahmad
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 306 - 310
  • [6] Social Event Classification via Boosted Multimodal Supervised Latent Dirichlet Allocation
    Qian, Shengsheng
    Zhang, Tianzhu
    Xu, Changsheng
    Hossain, M. Shamim
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2014, 11 (02)
  • [7] Multimodal Semantics-Based Supervised Latent Dirichlet Allocation for Event Classification
    Miao, Naiyang
    Xue, Feng
    Hong, Richang
    IEEE MULTIMEDIA, 2021, 28 (04) : 8 - 17
  • [8] Unsupervised Language Filtering using the Latent Dirichlet Allocation
    Zhang, Wei
    Clark, Robert A. J.
    Wang, Yongyuan
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1268 - 1272
  • [9] Unsupervised language identification based on Latent Dirichlet Allocation
    Zhang, Wei
    Clark, Robert A. J.
    Wang, Yongyuan
    Li, Wen
    COMPUTER SPEECH AND LANGUAGE, 2016, 39 : 47 - 66
  • [10] Automated classification of software change messages by semi-supervised Latent Dirichlet Allocation
    Fu, Ying
    Yan, Meng
    Zhang, Xiaohong
    Xu, Ling
    Yang, Dan
    Kymer, Jeffrey D.
    INFORMATION AND SOFTWARE TECHNOLOGY, 2015, 57 : 369 - 377