Numerical reasoning in machine reading comprehension tasks: are we there yet?

被引：0

作者：

Al-Negheimish, Hadeel ^{[1
]}

Madhyastha, Pranava ^{[1
,2
]}

Russo, Alessandra ^{[1
]}

机构：

[1] Imperial Coll London, London, England

[2] City Univ London, London, England

来源：

2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021) | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Numerical reasoning based machine reading comprehension is a task that involves reading comprehension along with using arithmetic operations such as addition, subtraction, sorting, and counting. The DROP benchmark (Dua et al., 2019) is a recent dataset that has inspired the design of NLP models aimed at solving this task. The current standings of these models in the DROP leaderboard, over standard metrics, suggest that the models have achieved near-human performance. However, does this mean that these models have learned to reason? In this paper, we present a controlled study on some of the top-performing model architectures for the task of numerical reasoning. Our observations suggest that the standard metrics are incapable of measuring progress towards such tasks.

引用

页码：9643 / 9649

页数：7

共 50 条

[1] NumNet: Machine Reading Comprehension with Numerical Reasoning
Ran, Qiu
Lin, Yankai
Li, Peng
Zhou, Jie
Liu, Zhiyuan
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2474 - 2484
[2] Fact -Driven Logical Reasoning for Machine Reading Comprehension
Ouyang, Siru
Zhang, Zhuosheng
Zhao, Hai
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18851 - 18859
[3] Interpretable modular knowledge reasoning for machine reading comprehension
Mucheng Ren
Heyan Huang
Yang Gao
Neural Computing and Applications, 2022, 34 : 9901 - 9918
[4] Interpretable modular knowledge reasoning for machine reading comprehension
Ren, Mucheng
Huang, Heyan
Gao, Yang
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (12): : 9901 - 9918
[5] LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical Reasoning
Liu, Jian
Cui, Leyang
Liu, Hanmeng
Huang, Dandan
Wang, Yile
Zhang, Yue
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3622 - 3628
[6] COSMOS QA: Machine Reading Comprehension with Contextual Commonsense Reasoning
Huang, Lifu
Le Bras, Ronan
Bhagavatula, Chandra
Choi, Yejin
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2391 - 2401
[7] Machine Reading Comprehension using Case-based Reasoning
Thai, Dung
Agarwal, Dhruv
Chaudhary, Mudit
Zhao, Wenlong
Das, Rajarshi
Zaheer, Manzil
Lee, Jay-Yoon
Hajishirzi, Hannaneh
McCallum, Andrew
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8414 - 8428
[8] Plug-and-Play Module for Commonsense Reasoning in Machine Reading Comprehension
Dai, Damai
Zheng, Hua
Sui, Zhifang
Chang, Baobao
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 29 - 41
[9] ESTER: A Machine Reading Comprehension Dataset for Reasoning about Event Semantic
Han, Rujun
Hsu, I-Hung
Sun, Jiao
Baylon, Julia
Ning, Qiang
Roth, Dan
Peng, Nanyun
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7543 - 7559
[10] The DARPA Machine Reading Program - Encouraging Linguistic and Reasoning Research with a Series of Reading Tasks
Strassel, Stephanie
Adams, Dan
Goldberg, Henry
Herr, Jonathan
Keesing, Ron
Oblinger, Daniel
Simpson, Heather
Schrag, Robert
Wright, Jonathan
LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,

← 1 2 3 4 5 →