PIQA: Reasoning about Physical Commonsense in Natural Language

被引:0
|
作者
Bisk, Yonatan [1 ,2 ,3 ,4 ]
Zellers, Rowan [1 ,4 ]
Le Bras, Ronan [1 ]
Gao, Jianfeng [2 ]
Choi, Yejin [1 ,4 ]
机构
[1] Allen Inst Artificial Intelligence, Seattle, WA 98103 USA
[2] Microsoft Res AI, Redmond, WA 98052 USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To apply eyeshadow without a brush, should I use a cotton swab or a toothpick? Questions requiring this kind of physical commonsense pose a challenge to today's natural language understanding systems. While recent pretrained models (such as BERT) have made progress on question answering over more abstract domains - such as news articles and encyclopedia entries, where text is plentiful - in more physical domains, text is inherently limited due to reporting bias. Can AI systems learn to reliably answer physical commonsense questions without experiencing the physical world? In this paper, we introduce the task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA. Though humans find the dataset easy (95% accuracy), large pretrained models struggle (similar to 75%). We provide analysis about the dimensions of knowledge that existing models lack, which offers significant opportunities for future research.
引用
收藏
页码:7432 / 7439
页数:8
相关论文
共 50 条
  • [21] Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning
    Lv, Changsheng
    Zhang, Shuai
    Tian, Yapeng
    Qi, Mengshi
    Ma, Huadong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [22] Commonsense reasoning
    Galitsky, Boris
    COMPUTATIONAL LINGUISTICS, 2007, 33 (01) : 145 - 146
  • [23] Commonsense reasoning about containers using radically incomplete information
    Davis, Ernest
    Marcus, Gary
    Frazier-Logue, Noah
    ARTIFICIAL INTELLIGENCE, 2017, 248 : 46 - 84
  • [24] COMMONSENSE REASONING ABOUT CAUSALITY - DERIVING BEHAVIOR FROM STRUCTURE
    KUIPERS, B
    ARTIFICIAL INTELLIGENCE, 1984, 24 (1-3) : 169 - 203
  • [25] A library of behaviors: Implementing commonsense reasoning about mental world
    Galitsky, B
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2004, 3215 : 307 - 313
  • [26] Reasoning about Actions and State Changes by Injecting Commonsense Knowledge
    Tandon, Niket
    Mishra, Bhavana Dalvi
    Grus, Joel
    Yih, Wen-tau
    Bosselut, Antoine
    Clark, Peter
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 57 - 66
  • [27] Temporal validity reassessment: commonsense reasoning about information obsoleteness
    Hosokawa, Taishi
    Jatowt, Adam
    Sugiyama, Kazunari
    DISCOVER COMPUTING, 2024, 27 (01)
  • [28] Natural (language) temporal logic: Reasoning about absolute and relative time
    Iwanska, L
    INTERNATIONAL JOURNAL OF EXPERT SYSTEMS, 1996, 9 (01): : 113 - 149
  • [29] MCOMET: Multimodal Fusion Transformer for Physical Audiovisual Commonsense Reasoning
    Zong, Daoming
    Sun, Shiliang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 5, 2023, : 6621 - 6629
  • [30] Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs
    Marasovic, Ana
    Bhagavatula, Chandra
    Park, Jae Sung
    Le Bras, Ronan
    Smith, Noah A.
    Choi, Yejin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2810 - 2829