Applying deep matching networks to Chinese medical question answering: a study and a dataset

被引:19
|
作者
He, Junqing [1 ,2 ]
Fu, Mingming [1 ,2 ]
Tu, Manshu [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Speech Acoust & Content Understanding, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical question answering; Chinese word segmentation; Semantic matching; Convolutional neural networks; Deep learning;
D O I
10.1186/s12911-019-0761-8
中图分类号
R-058 [];
学科分类号
摘要
BackgroundMedical and clinical question answering (QA) is highly concerned by researchers recently. Though there are remarkable advances in this field, the development in Chinese medical domain is relatively backward. It can be attributed to the difficulty of Chinese text processing and the lack of large-scale datasets. To bridge the gap, this paper introduces a Chinese medical QA dataset and proposes effective methods for the task.MethodsWe first construct a large scale Chinese medical QA dataset. Then we leverage deep matching neural networks to capture semantic interaction between words in questions and answers. Considering that Chinese Word Segmentation (CWS) tools may fail to identify clinical terms, we design a module to merge the word segments and produce a new representation. It learns the common compositions of words or segments by using convolutional kernels and selects the strongest signals by windowed pooling.ResultsThe best performer among popular CWS tools on our dataset is found. In our experiments, deep matching models substantially outperform existing methods. Results also show that our proposed semantic clustered representation module improves the performance of models by up to 5.5% Precision at 1 and 4.9% Mean Average Precision.ConclusionsIn this paper, we introduce a large scale Chinese medical QA dataset and cast the task into a semantic matching problem. We also compare different CWS tools and input units. Among the two state-of-the-art deep matching neural networks, MatchPyramid performs better. Results also show the effectiveness of the proposed semantic clustered representation module.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] PLMVQA: Applying Pseudo Labels for Medical Visual Question Answering with Limited Data
    Yu, Zheng
    Xie, Yutong
    Xia, Yong
    Wu, Qi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2023 WORKSHOPS, 2023, 14394 : 357 - 367
  • [22] MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset
    Li, Jing
    Zhong, Shangping
    Chen, Kaizhi
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8862 - 8874
  • [23] A Deep Transfer Learning Method for Medical Question Matching
    Shen, Yedan
    Huang, Xiaowei
    Tang, Buzhou
    Wang, Xiaolong
    Chen, Qingcai
    Ni, Yuan
    2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 515 - 516
  • [24] A Study of Deep Learning for Factoid Question Answering System
    Day, Min-Yuh
    Kuo, Yu-Ling
    2020 IEEE 21ST INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2020), 2020, : 419 - 424
  • [25] Deep Fuzzy Multiteacher Distillation Network for Medical Visual Question Answering
    Liu, Yishu
    Chen, Bingzhi
    Wang, Shuihua
    Lu, Guangming
    Zhang, Zheng
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (10) : 5413 - 5427
  • [26] Convolutional Deep Neural Networks for Document-Based Question Answering
    Fu, Jian
    Qiu, Xipeng
    Huang, Xuanjing
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 790 - 797
  • [27] Deep Modular Co-Attention Networks for Visual Question Answering
    Yu, Zhou
    Yu, Jun
    Cui, Yuhao
    Tao, Dacheng
    Tian, Qi
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6274 - 6283
  • [28] Relevant Question Answering in Community Based Networks Using Deep LSTM Neural Networks
    Karimi, Elaheh
    Majidi, Babak
    Manzuri, Mohammad Taghi
    2019 7TH IRANIAN JOINT CONGRESS ON FUZZY AND INTELLIGENT SYSTEMS (CFIS), 2019, : 36 - 40
  • [29] EQUALS: A Real-world Dataset for Legal Question Answering via Reading Chinese Laws
    Chen, Andong
    Yao, Feng
    Zhao, Xinyan
    Zhang, Yating
    Sun, Changlong
    Liu, Yun
    Shen, Weixing
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND LAW, ICAIL 2023, 2023, : 71 - 80
  • [30] Sentence Answer Selection for Open Domain Question Answering via Deep Word Matching
    Ghigi, Fabrizio
    Turcsany, Diana
    Kaltenbrunner, Thomas
    Cibelli, Maurizio
    HUMAN LANGUAGE TECHNOLOGY. CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, LTC 2017, 2020, 12598 : 291 - 303