Unsupervised Latent Dirichlet Allocation for supervised question classification

被引:38
|
作者
Momtazi, Saeedeh [1 ]
机构
[1] Amirkabir Univ Technol, Dept Comp Engn & Informat Technol, Tehran, Iran
关键词
Community-based QA; Question classification; LDA; MODELS;
D O I
10.1016/j.ipm.2018.01.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Question answering systems assist users in satisfying their information needs more precisely by providing focused responses to their questions. Among the various systems developed for such a purpose, community-based question answering has recently received researchers' attention due to the large amount of user-generated questions and answers in social question-and-answer platforms. Reusing such data sources requires an accurate information retrieval component enhanced by a question classifier. The question classification gives the system the possibility to have information about question categories to focus on questions and answers from relevant categories to the input question. In this paper, we propose a new method based on unsupervised Latent Dirichlet Allocation for classifying questions in community-based question answering. Our method first uses unsupervised topic modeling to extract topics from a large amount of unlabeled data. The learned topics are then used in the training phase to find their association with the available category labels in the training data. The category mixture of topics is finally used to predict the label of unseen data.
引用
收藏
页码:380 / 393
页数:14
相关论文
共 50 条
  • [41] Semi-supervised Document Clustering Based on Latent Dirichlet Allocation (LDA)
    秦永彬
    李解
    黄瑞章
    李晶
    JournalofDonghuaUniversity(EnglishEdition), 2016, 33 (05) : 685 - 688
  • [42] Regularized Semi-Supervised Latent Dirichlet Allocation for visual concept learning
    Zhuang, Liansheng
    Gao, Haoyuan
    Luo, Jiebo
    Lin, Zhouchen
    NEUROCOMPUTING, 2013, 119 : 26 - 32
  • [43] Sequential latent Dirichlet allocation
    Du, Lan
    Buntine, Wray
    Jin, Huidong
    Chen, Changyou
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 31 (03) : 475 - 503
  • [44] Collective Latent Dirichlet Allocation
    Shen, Zhi-Yong
    Sun, Jun
    Shen, Yi-Dong
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 1019 - 1024
  • [45] Initializing Deep Learning Based on Latent Dirichlet Allocation for Document Classification
    Jeon, Hyung-Bae
    Lee, Soo-Young
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT III, 2016, 9949 : 634 - 641
  • [46] Rail transit fault text classification based on the latent dirichlet allocation
    Li, R.
    Su, S.
    Wang, G.
    Qu, J.
    Cao, Y.
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1359 - 1364
  • [47] The Security of Latent Dirichlet Allocation
    Mei, Shike
    Zhu, Xiaojin
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 681 - 689
  • [48] A text classification model constructed by Latent Dirichlet Allocation and Deep Learning
    Liu, Yu
    Jin, Zhengping
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING 2015 (ICMMCCE 2015), 2015, 39 : 2501 - 2504
  • [49] Max-Margin Latent Dirichlet Allocation for Image Classification and Annotation
    Wang, Yang
    Mori, Greg
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [50] Novel Weighting Scheme for Unsupervised Language Model Adaptation Using Latent Dirichlet Allocation
    Haidar, Md Akmal
    O'Shaughnessy, Douglas
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2438 - 2441