PIQA: Reasoning about Physical Commonsense in Natural Language

被引:0
|
作者
Bisk, Yonatan [1 ,2 ,3 ,4 ]
Zellers, Rowan [1 ,4 ]
Le Bras, Ronan [1 ]
Gao, Jianfeng [2 ]
Choi, Yejin [1 ,4 ]
机构
[1] Allen Inst Artificial Intelligence, Seattle, WA 98103 USA
[2] Microsoft Res AI, Redmond, WA 98052 USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To apply eyeshadow without a brush, should I use a cotton swab or a toothpick? Questions requiring this kind of physical commonsense pose a challenge to today's natural language understanding systems. While recent pretrained models (such as BERT) have made progress on question answering over more abstract domains - such as news articles and encyclopedia entries, where text is plentiful - in more physical domains, text is inherently limited due to reporting bias. Can AI systems learn to reliably answer physical commonsense questions without experiencing the physical world? In this paper, we introduce the task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA. Though humans find the dataset easy (95% accuracy), large pretrained models struggle (similar to 75%). We provide analysis about the dimensions of knowledge that existing models lack, which offers significant opportunities for future research.
引用
收藏
页码:7432 / 7439
页数:8
相关论文
共 50 条
  • [1] Commonsense reasoning in and over natural language
    Liu, H
    Singh, P
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2004, 3215 : 293 - 306
  • [2] Commonsense reasoning about the physical world
    Bliss, Joan
    STUDIES IN SCIENCE EDUCATION, 2008, 44 (02) : 123 - 155
  • [3] A commonsense language for reasoning about causation and rational action
    Ortiz, CL
    ARTIFICIAL INTELLIGENCE, 1999, 111 (1-2) : 73 - 130
  • [4] Commonsense language for reasoning about causation and rational action
    Ortiz Jr., Charles L.
    Artificial Intelligence, 1999, 111 (01): : 73 - 130
  • [5] Analogical Chaining with Natural Language Instruction for Commonsense Reasoning
    Blass, Joseph A.
    Forbus, Kenneth D.
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4357 - 4363
  • [6] CommonsenseVIS: Visualizing and Understanding Commonsense Reasoning Capabilities of Natural Language Models
    Wang, Xingbo
    Huang, Renfei
    Jin, Zhihua
    Fang, Tianqing
    Qu, Huamin
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (01) : 273 - 283
  • [7] Commonsense reasoning about processes with RAP
    Arana, I
    MODELLING AND SIMULATION 1996, 1996, : 649 - 653
  • [8] Enabling Robots to Understand Incomplete Natural Language Instructions Using Commonsense Reasoning
    Chen, Haonan
    Tan, Hao
    Kuntz, Alan
    Bansal, Mohit
    Alterovitz, Ron
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 1963 - 1969
  • [9] Reasoning about inconsistencies in natural language requirements
    Gervasi, V
    Zowghi, D
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2005, 14 (03) : 277 - 330
  • [10] Psycholinguistic Diagnosis of Language Models' Commonsense Reasoning
    Cong, Yan
    PROCEEDINGS OF THE FIRST WORKSHOP ON COMMONSENSE REPRESENTATION AND REASONING (CSRR 2022), 2022, : 17 - 22