Path Spuriousness-aware Reinforcement Learning for Multi-Hop Knowledge Graph Reasoning

被引:0
|
作者
Jiang, Chunyang [1 ,2 ]
Zhu, Tianchen [1 ,2 ,4 ]
Zhou, Haoyi [1 ,3 ]
Liu, Chang [3 ]
Deng, Ting [1 ,2 ]
Hu, Chunming [1 ,3 ]
Li, Jianxin [1 ,2 ]
机构
[1] Beihang Univ, SKLSDE, Beijing, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[3] Beihang Univ, Sch Software, Beijing, Peoples R China
[4] Beihang Univ, Shenyuan Honors Coll, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-hop reasoning, a prevalent approach for query answering, aims at inferring new facts along reasonable paths over a knowledge graph. Reinforcement learning (RL) methods can be adopted by formulating the problem into a Markov decision process. However, common suffering within RL-based reasoning models is that the agent can be biased to spurious paths which coincidentally lead to the correct answer with poor explanation. In this work, we take a deep dive into this phenomenon and define a metric named Path Spuriousness (PS), to quantitatively estimate to what extent a path is spurious. Guided by the definition of PS, we design a model with a new reward that considers both answer accuracy and path reasonableness. We test our method on five datasets and experiments reveal that our method considerably enhances the agent's capacity to prevent spurious paths while keeping comparable to state-of-the-art performance.
引用
收藏
页码:3181 / 3192
页数:12
相关论文
共 50 条
  • [31] SRGCN: Graph-based multi-hop reasoning on knowledge graphs
    Wang, Zikang
    Li, Linjing
    Zeng, Daniel
    NEUROCOMPUTING, 2021, 454 : 280 - 290
  • [32] Multi-hop path reasoning of temporal knowledge graphs based on generative adversarial imitation learning
    Bai, Luyi
    Xiao, Qianwen
    Zhu, Lin
    KNOWLEDGE-BASED SYSTEMS, 2025, 316
  • [33] Rule-Aware Reinforcement Learning for Knowledge Graph Reasoning
    Hou, Zhongni
    Jin, Xiaolong
    Li, Zixuan
    Bai, Long
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4687 - 4692
  • [34] HSMH: A Hierarchical Sequence Multi-Hop Reasoning Model With Reinforcement Learning
    Wang, Dan
    Li, Bo
    Song, Bin
    Chen, Chen
    Yu, F. Richard
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (04) : 1638 - 1649
  • [35] Incorporating anticipation embedding into reinforcement learning framework for multi-hop knowledge graph question answering
    Cui, Hai
    Peng, Tao
    Xiao, Feng
    Han, Jiayu
    Han, Ridong
    Liu, Lu
    INFORMATION SCIENCES, 2023, 619 : 745 - 761
  • [36] Reinforcement learning with dynamic completion for answering multi-hop questions over incomplete knowledge graph
    Cui, Hai
    Peng, Tao
    Han, Ridong
    Zhu, Beibei
    Bi, Haijia
    Liu, Lu
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [37] Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question Answering
    Feng, Yanlin
    Chen, Xinyue
    Lin, Bill Yuchen
    Wang, Peifeng
    Yan, Jun
    Ren, Xiang
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1295 - 1309
  • [38] Improving Multi-hop Logical Reasoning in Knowledge Graphs with Context-Aware Query Representation Learning
    Kim, Jeonghoon
    Jung, Heesoo
    Jang, Hyeju
    Park, Hogun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 15978 - 15991
  • [39] ConvHiA: convolutional network with hierarchical attention for knowledge graph multi-hop reasoning
    Dengao Li
    Shuyi Miao
    Baofeng Zhao
    Yu Zhou
    Ding Feng
    Jumin Zhao
    Xupeng Niu
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 2301 - 2315
  • [40] ConvHiA: convolutional network with hierarchical attention for knowledge graph multi-hop reasoning
    Li, Dengao
    Miao, Shuyi
    Zhao, Baofeng
    Zhou, Yu
    Feng, Ding
    Zhao, Jumin
    Niu, Xupeng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (07) : 2301 - 2315