Language processing and learning models for community question answering in Arabic

被引:16
|
作者
Romeo, Salvatore [1 ]
Da San Martino, Giovanni [1 ]
Belinkov, Yonatan [2 ]
Barron-Cedeno, Alberto [1 ]
Eldesouki, Mohamed [1 ]
Darwish, Kareem [1 ]
Mubarak, Hamdy [1 ]
Glass, James [2 ]
Moschitti, Alessandro [1 ]
机构
[1] HBKU, Qatar Comp Res Inst, Doha, Qatar
[2] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
Community question answering; Constituency parsing in Arabic; Tree-kernel-based ranking; Long short-term memory neural networks; Attention models;
D O I
10.1016/j.ipm.2017.07.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we focus on the problem of question ranking in community question answering (cQA) forums in Arabic. We address the task with machine learning algorithms using advanced Arabic text representations. The latter are obtained by applying tree kernels to constituency parse trees combined with textual similarities, including word embeddings. Our two main contributions are: (i) an Arabic language processing pipeline based on UIMA-from segmentation to constituency parsing-built on top of Farasa, a state-of-the-art Arabic language processing toolkit; and (ii) the application of long short-term memory neural networks to identify the best text fragments in questions to be used in our tree-kernel-based ranker. Our thorough experimentation on a recently released cQA dataset shows that the Arabic linguistic processing provided by Farasa produces strong results and that neural networks combined with tree kernels further boost the performance in terms of both efficiency and accuracy. Our approach also enables an implicit comparison between different processing pipelines as our tests on Farasa and Stanford parsers demonstrate. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:274 / 290
页数:17
相关论文
共 50 条
  • [41] Tree -of-Reasoning Question Decomposition for Complex Question Answering with Large Language Models
    Zhang, Kun
    Zeng, Jiali
    Meng, Fandong
    Wang, Yuanzhuo
    Sun, Shiqi
    Bai, Long
    Shen, Huawei
    Zhou, Jie
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19560 - 19568
  • [42] Dynamic Heterogeneous-Graph Reasoning with Language Models and Knowledge Representation Learning for Commonsense Question Answering
    Wang, Yujie
    Zhang, Hu
    Liang, Jiye
    Li, Ru
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14048 - 14063
  • [43] Deep Neural Network Models for Question Classification in Community Question-Answering Forums
    Upadhya, Akshay B.
    Udupa, Swastik
    Kamath, Sowmya S.
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [44] Only recommend to you: Towards personalized models for question recommendation in community question answering
    Lian, Xin
    Hu, Xiangyu
    Zhang, Haiwei
    Chen, Xinyu
    Yuan, Xiaojie
    Journal of Information and Computational Science, 2012, 9 (16): : 4987 - 4995
  • [45] Large Language Models are Temporal and Causal Reasoners for Video Question Answering
    Ko, Dohwan
    Lee, Ji Soo
    Kang, Wooyoung
    Roh, Byungseok
    Kim, Hyunwoo J.
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 4300 - 4316
  • [46] How Can We Know When Language Models Know? On the Calibration of Language Models for Question Answering
    Jiang, Zhengbao
    Araki, Jun
    Ding, Haibo
    Neubig, Graham
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 (09) : 962 - 977
  • [47] QUESTION ANSWERING SYSTEM ON MATHEMATICAL-MODELS (QAS) - DESCRIPTION OF LANGUAGE
    KONOPASEK, M
    PAPACONSTADOPOULOS, C
    COMPUTER LANGUAGES, 1978, 3 (03): : 145 - 155
  • [48] Chart Question Answering based on Modality Conversion and Large Language Models
    Liu, Yi-Cheng
    Chu, Wei-Ta
    PROCEEDINGS OF THE FIRST ACM WORKSHOP ON AI-POWERED QUESTION ANSWERING SYSTEMS FOR MULTIMEDIA, AIQAM 2024, 2024, : 19 - 24
  • [49] MedExpQA: Multilingual benchmarking of Large Language Models for Medical Question Answering
    Alonso, Inigo
    Oronoz, Maite
    Agerri, Rodrigo
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 155
  • [50] Knowledge Base Question Answering Based on Deep Learning Models
    Xie, Zhiwen
    Zeng, Zhao
    Zhou, Guangyou
    He, Tingting
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 300 - 311