Keep Skills in Mind: Understanding and Implementing Skills in Commonsense Question Answering

被引:0
|
作者
Bao, Meikai [1 ,2 ]
Liu, Qi [1 ,2 ]
Zhang, Kai [1 ,2 ]
Liu, Ye [1 ,2 ]
Yue, Linan [1 ,2 ]
Li, Longfei [3 ]
Zhou, Jun [3 ]
机构
[1] Univ Sci & Technol China, Anhui Prov Key Lab Big Data Anal & Applicat, Hefei, Peoples R China
[2] State Key Lab Cognit Intelligence, Beijing, Peoples R China
[3] Ant Financial Serv Grp, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Commonsense Question Answering (CQA) aims to answer questions that require human commonsense. Closed-book CQA, as one of the subtasks, requires the model to answer questions without retrieving external knowledge, which emphasizes the importance of the model's problem-solving ability. Most previous methods relied on large-scale pre-trained models to generate question-related knowledge while ignoring the crucial role of skills in the process of answering commonsense questions. Generally, skills refer to the learned ability in performing a specific task or activity, which are derived from knowledge and experience. In this paper, we introduce a new approach named Dynamic Skill-aware Commonsense Question Answering (DSCQA), which transcends the limitations of traditional methods by informing the model about the need for each skill in questions and utilizes skills as a critical driver in CQA process. To be specific, DSCQA first employs commonsense skill extraction module to generate various skill representations. Then, DSCQA utilizes dynamic skill module to generate dynamic skill representations. Finally, in perception and emphasis module, various skills and dynamic skill representations are used to help question-answering process. Experimental results on two publicly available CQA datasets show the effectiveness of our proposed model and the considerable impact of introducing skills.
引用
收藏
页码:5012 / 5020
页数:9
相关论文
共 50 条
  • [41] Heterogeneous-Graph Reasoning With Context Paraphrase for Commonsense Question Answering
    Wang, Yujie
    Zhang, Hu
    Liang, Jiye
    Li, Ru
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3759 - 3770
  • [42] Knowledge-aware adaptive graph network for commonsense question answering
    Kang, Long
    Li, Xiaoge
    An, Xiaochun
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (05) : 1305 - 1324
  • [43] VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge
    Ravi, Sahithya
    Chinchure, Aditya
    Sigal, Leonid
    Liao, Renjie
    Shwartz, Vered
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1155 - 1165
  • [44] Improving Question Answering by Commonsense-Based Pre-training
    Zhong, Wanjun
    Tang, Duyu
    Duan, Nan
    Zhou, Ming
    Wang, Jiahai
    Yin, Jian
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 16 - 28
  • [45] Implicit Relation Inference with Deep Path Extraction for Commonsense Question Answering
    Yang, Peng
    Liu, Zijian
    Li, Bing
    Zhang, Penghui
    NEURAL PROCESSING LETTERS, 2022, 54 (06) : 4751 - 4768
  • [46] Meta-path reasoning of knowledge graph for commonsense question answering
    Zhang, Miao
    He, Tingting
    Dong, Ming
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (01)
  • [47] KEPR: Knowledge Enhancement and Plausibility Ranking for Generative Commonsense Question Answering
    Li, Zhifeng
    Zou, Bowei
    Fan, Yifan
    Hong, Yu
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [48] Retrieval-Augmented Knowledge Graph Reasoning for Commonsense Question Answering
    Sha, Yuchen
    Feng, Yujian
    He, Miao
    Liu, Shangdong
    Ji, Yimu
    MATHEMATICS, 2023, 11 (15)
  • [49] Implicit Relation Inference with Deep Path Extraction for Commonsense Question Answering
    Peng Yang
    Zijian Liu
    Bing Li
    Penghui Zhang
    Neural Processing Letters, 2022, 54 : 4751 - 4768
  • [50] Language skills and metalinguistic skills, theory of the mind and development of literacy
    Mélançon, J
    Ziarko, H
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2000, 35 (3-4) : 149 - 149