Language processing and learning models for community question answering in Arabic

被引:16
|
作者
Romeo, Salvatore [1 ]
Da San Martino, Giovanni [1 ]
Belinkov, Yonatan [2 ]
Barron-Cedeno, Alberto [1 ]
Eldesouki, Mohamed [1 ]
Darwish, Kareem [1 ]
Mubarak, Hamdy [1 ]
Glass, James [2 ]
Moschitti, Alessandro [1 ]
机构
[1] HBKU, Qatar Comp Res Inst, Doha, Qatar
[2] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
Community question answering; Constituency parsing in Arabic; Tree-kernel-based ranking; Long short-term memory neural networks; Attention models;
D O I
10.1016/j.ipm.2017.07.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we focus on the problem of question ranking in community question answering (cQA) forums in Arabic. We address the task with machine learning algorithms using advanced Arabic text representations. The latter are obtained by applying tree kernels to constituency parse trees combined with textual similarities, including word embeddings. Our two main contributions are: (i) an Arabic language processing pipeline based on UIMA-from segmentation to constituency parsing-built on top of Farasa, a state-of-the-art Arabic language processing toolkit; and (ii) the application of long short-term memory neural networks to identify the best text fragments in questions to be used in our tree-kernel-based ranker. Our thorough experimentation on a recently released cQA dataset shows that the Arabic linguistic processing provided by Farasa produces strong results and that neural networks combined with tree kernels further boost the performance in terms of both efficiency and accuracy. Our approach also enables an implicit comparison between different processing pipelines as our tests on Farasa and Stanford parsers demonstrate. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:274 / 290
页数:17
相关论文
共 50 条
  • [21] Arabic Question Answering Using Ontology
    Albarghothi, Ali
    Khater, Feras
    Shaalan, Khaled
    ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 : 183 - 191
  • [22] Arabic question answering system: a survey
    Tahani H. Alwaneen
    Aqil M. Azmi
    Hatim A. Aboalsamh
    Erik Cambria
    Amir Hussain
    Artificial Intelligence Review, 2022, 55 : 207 - 253
  • [23] User authority ranking models for community question answering
    Raoa, Yanghui
    Xie, Haoran
    Liu, Xuebo
    Li, Qing
    Wang, Fu Lee
    Wong, Tak-Lam
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (05) : 2533 - 2542
  • [24] Arabic question answering system: a survey
    Alwaneen, Tahani H.
    Azmi, Aqil M.
    Aboalsamh, Hatim A.
    Cambria, Erik
    Hussain, Amir
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (01) : 207 - 253
  • [25] Enhanced Arabic Question Answering System
    Kamal, Aml Ibrahim
    Azim, Mostafa Abd
    Mahmoud, Mohamed
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 641 - 645
  • [26] VAQA: Visual Arabic Question Answering
    Kamel, Sarah M. M.
    Hassan, Shimaa I. I.
    Elrefaei, Lamiaa
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 10803 - 10823
  • [27] VAQA: Visual Arabic Question Answering
    Sarah M. kamel
    Shimaa I. Hassan
    Lamiaa Elrefaei
    Arabian Journal for Science and Engineering, 2023, 48 : 10803 - 10823
  • [28] Learning Distributed Representations of Data in Community Question Answering for Question Retrieval
    Zhang, Kai
    Wu, Wei
    Wang, Fang
    Zhou, Ming
    Li, Zhoujun
    PROCEEDINGS OF THE NINTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'16), 2016, : 533 - 542
  • [29] Using Domain Knowledge and Bilingual Resources for addressing Community Question Answering for Arabic
    El Adlouni, Yassine
    Lahbari, Imane
    Rodriguez, Horacio
    Meknassi, Mohammed
    El Alaoui, Said Ouatik
    Ennahnahi, Noureddine
    2016 4TH IEEE INTERNATIONAL COLLOQUIUM ON INFORMATION SCIENCE AND TECHNOLOGY (CIST), 2016, : 368 - 373
  • [30] Efficient Question Answering Based on Language Models and Knowledge Graphs
    Li, Fengying
    Huang, Hongfei
    Dong, Rongsheng
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 340 - 351