Numerical reasoning in machine reading comprehension tasks: are we there yet?

被引:0
|
作者
Al-Negheimish, Hadeel [1 ]
Madhyastha, Pranava [1 ,2 ]
Russo, Alessandra [1 ]
机构
[1] Imperial Coll London, London, England
[2] City Univ London, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerical reasoning based machine reading comprehension is a task that involves reading comprehension along with using arithmetic operations such as addition, subtraction, sorting, and counting. The DROP benchmark (Dua et al., 2019) is a recent dataset that has inspired the design of NLP models aimed at solving this task. The current standings of these models in the DROP leaderboard, over standard metrics, suggest that the models have achieved near-human performance. However, does this mean that these models have learned to reason? In this paper, we present a controlled study on some of the top-performing model architectures for the task of numerical reasoning. Our observations suggest that the standard metrics are incapable of measuring progress towards such tasks.
引用
收藏
页码:9643 / 9649
页数:7
相关论文
共 50 条
  • [1] NumNet: Machine Reading Comprehension with Numerical Reasoning
    Ran, Qiu
    Lin, Yankai
    Li, Peng
    Zhou, Jie
    Liu, Zhiyuan
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2474 - 2484
  • [2] Fact -Driven Logical Reasoning for Machine Reading Comprehension
    Ouyang, Siru
    Zhang, Zhuosheng
    Zhao, Hai
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18851 - 18859
  • [3] Interpretable modular knowledge reasoning for machine reading comprehension
    Mucheng Ren
    Heyan Huang
    Yang Gao
    Neural Computing and Applications, 2022, 34 : 9901 - 9918
  • [4] Interpretable modular knowledge reasoning for machine reading comprehension
    Ren, Mucheng
    Huang, Heyan
    Gao, Yang
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (12): : 9901 - 9918
  • [5] LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical Reasoning
    Liu, Jian
    Cui, Leyang
    Liu, Hanmeng
    Huang, Dandan
    Wang, Yile
    Zhang, Yue
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3622 - 3628
  • [6] COSMOS QA: Machine Reading Comprehension with Contextual Commonsense Reasoning
    Huang, Lifu
    Le Bras, Ronan
    Bhagavatula, Chandra
    Choi, Yejin
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2391 - 2401
  • [7] Machine Reading Comprehension using Case-based Reasoning
    Thai, Dung
    Agarwal, Dhruv
    Chaudhary, Mudit
    Zhao, Wenlong
    Das, Rajarshi
    Zaheer, Manzil
    Lee, Jay-Yoon
    Hajishirzi, Hannaneh
    McCallum, Andrew
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8414 - 8428
  • [8] Plug-and-Play Module for Commonsense Reasoning in Machine Reading Comprehension
    Dai, Damai
    Zheng, Hua
    Sui, Zhifang
    Chang, Baobao
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 29 - 41
  • [9] ESTER: A Machine Reading Comprehension Dataset for Reasoning about Event Semantic
    Han, Rujun
    Hsu, I-Hung
    Sun, Jiao
    Baylon, Julia
    Ning, Qiang
    Roth, Dan
    Peng, Nanyun
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7543 - 7559
  • [10] The DARPA Machine Reading Program - Encouraging Linguistic and Reasoning Research with a Series of Reading Tasks
    Strassel, Stephanie
    Adams, Dan
    Goldberg, Henry
    Herr, Jonathan
    Keesing, Ron
    Oblinger, Daniel
    Simpson, Heather
    Schrag, Robert
    Wright, Jonathan
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,