Unsupervised Latent Dirichlet Allocation for supervised question classification

被引:38
|
作者
Momtazi, Saeedeh [1 ]
机构
[1] Amirkabir Univ Technol, Dept Comp Engn & Informat Technol, Tehran, Iran
关键词
Community-based QA; Question classification; LDA; MODELS;
D O I
10.1016/j.ipm.2018.01.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Question answering systems assist users in satisfying their information needs more precisely by providing focused responses to their questions. Among the various systems developed for such a purpose, community-based question answering has recently received researchers' attention due to the large amount of user-generated questions and answers in social question-and-answer platforms. Reusing such data sources requires an accurate information retrieval component enhanced by a question classifier. The question classification gives the system the possibility to have information about question categories to focus on questions and answers from relevant categories to the input question. In this paper, we propose a new method based on unsupervised Latent Dirichlet Allocation for classifying questions in community-based question answering. Our method first uses unsupervised topic modeling to extract topics from a large amount of unlabeled data. The learned topics are then used in the training phase to find their association with the available category labels in the training data. The category mixture of topics is finally used to predict the label of unseen data.
引用
收藏
页码:380 / 393
页数:14
相关论文
共 50 条
  • [31] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 601 - 608
  • [32] Classification of New Titles by Two Stage Latent Dirichlet Allocation
    Guven, Zekeriya Anil
    Diri, Banu
    Cakaloglu, Tolgahan
    2018 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2018, : 99 - 103
  • [33] Latent Dirichlet Allocation for Classification using Gene Expression Data
    Yalamanchili, Hima Bindu
    Kho, Soon Jye
    Raymer, Michael L.
    2017 IEEE 17TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2017, : 39 - 44
  • [34] A New Latent generalized Dirichlet Allocation Model for Image Classification
    Ihou, Koffi Eddy
    Bouguila, Nizar
    PROCEEDINGS OF THE 2017 SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA 2017), 2017,
  • [35] Classification of Indonesian News Articles based on Latent Dirichlet Allocation
    Kusumaningrum, Retno
    Adhy, Satriyo
    Wiedjayanto, M. Ihsan Aji
    Suryono
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2016,
  • [36] Feature Substitution Using Latent Dirichlet Allocation for Text Classification
    Mathivanan, Norsyela Muhammad Noor
    Janor, Roziah Mohd
    Abd Razak, Shukor
    Ghani, Nor Azura Md.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 1087 - 1098
  • [37] Unsupervised segmentation of greenhouse plant images based on modified Latent Dirichlet Allocation
    Wang, Yi
    Xu, Lihong
    PEERJ, 2018, 6
  • [38] UNSUPERVISED LANGUAGE MODEL ADAPTATION USING LATENT DIRICHLET ALLOCATION AND DYNAMIC MARGINALS
    Haidar, Md. Akmal
    O'Shaughnessy, Douglas
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1480 - 1484
  • [39] Comparing Hierarchical Dirichlet Process with Latent Dirichlet Allocation in Bug Report Multiclass Classification
    Limsettho, Nachai
    Hata, Hideaki
    Matsumoto, Ken-ichi
    2014 15TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2014, : 137 - 142
  • [40] Regularized Semi-supervised Latent Dirichlet Allocation for Visual Concept Learning
    Zhuang, Liansheng
    She, Lanbo
    Huang, Jingjing
    Luo, Jiebo
    Yu, Nenghai
    ADVANCES IN MULTIMEDIA MODELING, PT I, 2011, 6523 : 403 - +