Template-Based Math Word Problem Solvers with Recursive Neural Networks

被引:0
|
作者
Wang, Lei [1 ,2 ]
Zhang, Dongxiang [1 ,2 ,3 ]
Zhang, Jipeng [1 ,2 ]
Xu, Xing [1 ,2 ,3 ]
Gao, Lianli [1 ,2 ]
Dai, Bing Tian [4 ]
Shen, Heng Tao [1 ,2 ]
机构
[1] UESTC, Ctr Future Media, Chengdu, Sichuan, Peoples R China
[2] UESTC, Sch Comp Sci & Engn, Chengdu, Sichuan, Peoples R China
[3] Afanti Res, Chengdu, Sichuan, Peoples R China
[4] Singapore Management Univ, Sch Informat Syst, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The design of automatic solvers to arithmetic math word problems has attracted considerable attention in recent years and a large number of datasets and methods have been published. Among them, Math23K is the largest data corpus that is very helpful to evaluate the generality and robustness of a proposed solution. The best performer in Math23K is a seq2seq model based on LSTM to generate the math expression. However, the model suffers from performance degradation in large space of target expressions. In this paper, we propose a template-based solution based on recursive neural network for math expression construction. More specifically, we first apply a seq2seq model to predict a tree-structure template, with inferred numbers as leaf nodes and unknown operators as inner nodes. Then, we design a recursive neural network to encode the quantity with Bi-LSTM and self attention, and infer the unknown operator nodes in a bottom-up manner. The experimental results clearly establish the superiority of our new framework as we improve the accuracy by a wide margin in two of the largest datasets, i.e., from 58.1% to 66.9% in Math23K and from 62.8% to 66.8% in MAWPS.
引用
收藏
页码:7144 / 7151
页数:8
相关论文
共 50 条
  • [1] Template-Based Contrastive Distillation Pretraining for Math Word Problem Solving
    Qin, Jinghui
    Yang, Zhicheng
    Chen, Jiaqi
    Liang, Xiaodan
    Lin, Liang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12823 - 12835
  • [2] Template-based Question Answering using Recursive Neural Networks
    Athreya, Ram G.
    Bansal, Srividya K.
    Ngomo, Axel-Cyrille Ngonga
    Usbeck, Ricardo
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, : 195 - 198
  • [3] Generalizing Math Word Problem Solvers via Solution Diversification
    Liang, Zhenwen
    Zhang, Jipeng
    Wang, Lei
    Wang, Yan
    Shao, Jie
    Zhang, Xiangliang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13183 - 13191
  • [4] An Introspective Data Augmentation Method for Training Math Word Problem Solvers
    Qin, Jinghui
    Huang, Zhongzhan
    Zeng, Ying
    Zhang, Quanshi
    Lin, Liang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3113 - 3127
  • [5] A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers
    Miao, Shen-Yun
    Liang, Chao-Chun
    Su, Keh-Yih
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 975 - 984
  • [6] The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers
    Zhang, Dongxiang
    Wang, Lei
    Zhang, Luming
    Dai, Bing Tian
    Shen, Heng Tao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (09) : 2287 - 2305
  • [7] Comparative study of typical neural solvers in solving math word problems
    He, Bin
    Yu, Xinguo
    Huang, Litian
    Meng, Hao
    Liang, Guanghua
    Chen, Shengnan
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 5805 - 5830
  • [8] Template-based Neural Program Repair
    Meng, Xiangxin
    Wang, Xu
    Zhang, Hongyu
    Sun, Hailong
    Liu, Xudong
    Hu, Chunming
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 1456 - 1468
  • [9] Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation
    Liang, Zhenwen
    Yu, Wenhao
    Rajpurohit, Tanmay
    Clark, Peter
    Zhang, Xiangliang
    Kalyan, Ashwin
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 14384 - 14396
  • [10] Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks
    Walsh, Ian
    Bau, Davide
    Martin, Alberto J. M.
    Mooney, Catherine
    Vullo, Alessandro
    Pollastri, Gianluca
    BMC STRUCTURAL BIOLOGY, 2009, 9