Semantic-Based Data Augmentation for Math Word Problems

被引:2
|
作者
Li, Ailisi [1 ]
Xiao, Yanghua [1 ,2 ]
Liang, Jiaqing [1 ]
Chen, Yunwen [3 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Data Sci, Shanghai, Peoples R China
[2] Fudan Aishu Cognit Intelligence Joint Res Ctr, Shanghai, Peoples R China
[3] DataGrand Inc, Shanghai, Peoples R China
关键词
Math word problem; Data augmentation;
D O I
10.1007/978-3-031-00129-1_3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It's hard for neural MWP solvers to deal with tiny local variances. In MWP task, some local changes conserve the original semantic while the others may totally change the underlying logic. Currently, existing datasets for MWP task contain limited samples which are key for neural models to learn to disambiguate different kinds of local variances in questions and solve the questions correctly. In this paper, we propose a set of novel data augmentation approaches to supplement existing datasets with such data that are augmented with different kinds of local variances, and help to improve the generalization ability of current neural models. New samples are generated by knowledge guided entity replacement, and logic guided problem reorganization. The augmentation approaches are ensured to keep the consistency between the new data and their labels. Experimental results have shown the necessity and the effectiveness of our methods.
引用
收藏
页码:36 / 51
页数:16
相关论文
共 50 条
  • [21] Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers
    Kumar, Vivek
    Maheshwary, Rishabh
    Pudi, Vikram
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4194 - 4206
  • [22] Math-word embedding in math search and semantic extraction
    André Greiner-Petter
    Abdou Youssef
    Terry Ruas
    Bruce R. Miller
    Moritz Schubotz
    Akiko Aizawa
    Bela Gipp
    Scientometrics, 2020, 125 : 3017 - 3046
  • [23] Recycling Numeracy Data Augmentation with Symbolic Verification for Math Word Problem Solving
    Jen, Tien-Yi
    Huang, Hen-Hsen
    Chen, Hsin-Hsi
    2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021), 2021, : 653 - 657
  • [24] Semantic-Based Web API Composition for Data Mashups
    Lee, Yong-Ju
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2015, 31 (04) : 1233 - 1248
  • [25] A semantic-based approach for Machine Learning data analysis
    Pinto, Agnese
    Scioscia, Floriano
    Loseto, Giuseppe
    Ruta, Michele
    Bove, Eliana
    Di Sciascio, Eugenio
    2015 IEEE 9TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2015, : 324 - 327
  • [26] Math-word embedding in math search and semantic extraction
    Greiner-Petter, Andre
    Youssef, Abdou
    Ruas, Terry
    Miller, Bruce R.
    Schubotz, Moritz
    Aizawa, Akiko
    Gipp, Bela
    SCIENTOMETRICS, 2020, 125 (03) : 3017 - 3046
  • [27] CSONG:A Semantic-based Hybrid Overlay Network for Data Grid
    Li, Kan
    Zheng, Zhikun
    Xing, Xiaoxing
    Sun, Xin
    GCC 2008: Seventh International Conference on Grid and Cooperative Computing, Proceedings, 2008, : 266 - 272
  • [28] Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving
    Yigit G.
    Amasyali M.F.
    SN Computer Science, 5 (5)
  • [29] ChemOnt: A semantic-based ontology for chemical and biological data integration
    Feunang, Yannick Djoumbou
    Wishart, David S.
    Karu, Naama
    Marcu, Ana
    Lo, Elvis
    Guo, An Chi
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [30] SEiS: A semantic-based system for integrating building energy data
    Madrazo, L.
    Massetti, M.
    Sicilia, A.
    Wadel, G.
    Ianni, M.
    INFORMES DE LA CONSTRUCCION, 2015, 67 (537)