Semantic-Based Data Augmentation for Math Word Problems

被引:2
|
作者
Li, Ailisi [1 ]
Xiao, Yanghua [1 ,2 ]
Liang, Jiaqing [1 ]
Chen, Yunwen [3 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Data Sci, Shanghai, Peoples R China
[2] Fudan Aishu Cognit Intelligence Joint Res Ctr, Shanghai, Peoples R China
[3] DataGrand Inc, Shanghai, Peoples R China
关键词
Math word problem; Data augmentation;
D O I
10.1007/978-3-031-00129-1_3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It's hard for neural MWP solvers to deal with tiny local variances. In MWP task, some local changes conserve the original semantic while the others may totally change the underlying logic. Currently, existing datasets for MWP task contain limited samples which are key for neural models to learn to disambiguate different kinds of local variances in questions and solve the questions correctly. In this paper, we propose a set of novel data augmentation approaches to supplement existing datasets with such data that are augmented with different kinds of local variances, and help to improve the generalization ability of current neural models. New samples are generated by knowledge guided entity replacement, and logic guided problem reorganization. The augmentation approaches are ensured to keep the consistency between the new data and their labels. Experimental results have shown the necessity and the effectiveness of our methods.
引用
收藏
页码:36 / 51
页数:16
相关论文
共 50 条
  • [1] RODA: Reverse Operation Based Data Augmentation for Solving Math Word Problems
    Liu, Qianying
    Guan, Wenyu
    Li, Sujian
    Cheng, Fei
    Kawahara, Daisuke
    Kurohashi, Sadao
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1 - 11
  • [2] Simple Semantic-based Data Augmentation for Named Entity Recognition in Biomedical Texts
    Phan, Uyen T. P.
    Nguyen, Nhung T. H.
    PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), 2022, : 123 - 129
  • [3] Research on the semantic-based co-word analysis
    Wang, Zhong-Yi
    Li, Gang
    Li, Chun-Ya
    Li, Ang
    SCIENTOMETRICS, 2012, 90 (03) : 855 - 875
  • [4] Research on the semantic-based co-word analysis
    Zhong-Yi Wang
    Gang Li
    Chun-Ya Li
    Ang Li
    Scientometrics, 2012, 90 : 855 - 875
  • [5] Improving Math Word Problems Solver with Logical Semantic Similarity
    Lu, Ting
    Jiang, Haitao
    Chang, Shan
    Liu, Guohua
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [6] Semantic-Based Industrial Engineering: Problems and Solutions
    Zarri, Gian Piero
    Sabri, Lyazid
    Chibani, Abdelghani
    Amirat, Yacine
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2010), 2010, : 1022 - 1027
  • [7] Semantic-based data access services on the grid
    Huang, H
    Shi, ZZ
    Cheng, Y
    Qiu, LR
    He, XX
    PROCEEDINGS OF THE 8TH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1-3, 2005, : 1554 - 1557
  • [8] The Application of Semantic-based Classification on Big Data
    Al Zamil, Mohammed G. H.
    Samarah, Samer
    2014 5TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2014,
  • [9] OSCAR: A Semantic-based Data Binning Approach
    Setlur, Vidya
    Correll, Michael
    Battersby, Sarah
    2022 IEEE VISUALIZATION CONFERENCE - SHORT PAPERS (VIS), 2022, : 100 - 104
  • [10] An Introspective Data Augmentation Method for Training Math Word Problem Solvers
    Qin, Jinghui
    Huang, Zhongzhan
    Zeng, Ying
    Zhang, Quanshi
    Lin, Liang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3113 - 3127