PIQA: Reasoning about Physical Commonsense in Natural Language

被引:0
|
作者
Bisk, Yonatan [1 ,2 ,3 ,4 ]
Zellers, Rowan [1 ,4 ]
Le Bras, Ronan [1 ]
Gao, Jianfeng [2 ]
Choi, Yejin [1 ,4 ]
机构
[1] Allen Inst Artificial Intelligence, Seattle, WA 98103 USA
[2] Microsoft Res AI, Redmond, WA 98052 USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To apply eyeshadow without a brush, should I use a cotton swab or a toothpick? Questions requiring this kind of physical commonsense pose a challenge to today's natural language understanding systems. While recent pretrained models (such as BERT) have made progress on question answering over more abstract domains - such as news articles and encyclopedia entries, where text is plentiful - in more physical domains, text is inherently limited due to reporting bias. Can AI systems learn to reliably answer physical commonsense questions without experiencing the physical world? In this paper, we introduce the task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA. Though humans find the dataset easy (95% accuracy), large pretrained models struggle (similar to 75%). We provide analysis about the dimensions of knowledge that existing models lack, which offers significant opportunities for future research.
引用
收藏
页码:7432 / 7439
页数:8
相关论文
共 50 条
  • [41] Definability and commonsense reasoning
    Amati, G
    Aiello, LC
    Pirri, F
    ARTIFICIAL INTELLIGENCE, 1997, 93 (1-2) : 169 - 199
  • [42] ROCK(sic): Causal Inference Principles for Reasoning about Commonsense Causality
    Zhang, Jiayao
    Zhang, Hongming
    Su, Weijie J.
    Roth, Dan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [43] Approximate reasoning about natural language: A certain distributional-mereological model
    Polkowski, LT
    Semeniuk-Polkowska, M
    MATHEMATICAL AND COMPUTATIONAL ANALYSIS OF NATURAL LANGUAGE, 1998, 45 : 239 - 252
  • [44] Natural Language Reasoning, A Survey
    Yu, Fei
    Zhang, Hongbo
    Tiwari, Prayag
    Wang, Benyou
    ACM COMPUTING SURVEYS, 2024, 56 (12)
  • [45] Probabilistic reasoning and natural language
    Macchi, Laura
    Bagassi, Maria
    BIOLOGICAL AND CULTURAL BASES OF HUMAN INFERENCE, 2006, : 223 - 239
  • [46] Influence of Natural Language on Reasoning
    Skelac, Ines
    Smokrovic, Nenad
    FILOZOFSKA ISTRAZIVANJA, 2017, 37 (04): : 709 - 722
  • [47] Language-based reasoning graph neural network for commonsense question answering
    Yang, Meng
    Wang, Yihao
    Gu, Yu
    NEURAL NETWORKS, 2025, 181
  • [48] Vision-Language-Knowledge Co-Embedding for Visual Commonsense Reasoning
    Lee, JaeYun
    Kim, Incheol
    SENSORS, 2021, 21 (09)
  • [49] RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge
    Lin, Bill Yuchen
    Wu, Ziyi
    Yang, Yichi
    Lee, Dong-Ho
    Ren, Xiang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1504 - 1515
  • [50] Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
    Bhargava, Prajjwal
    Ng, Vincent
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12317 - 12325