PIQA: Reasoning about Physical Commonsense in Natural Language

被引:0
|
作者
Bisk, Yonatan [1 ,2 ,3 ,4 ]
Zellers, Rowan [1 ,4 ]
Le Bras, Ronan [1 ]
Gao, Jianfeng [2 ]
Choi, Yejin [1 ,4 ]
机构
[1] Allen Inst Artificial Intelligence, Seattle, WA 98103 USA
[2] Microsoft Res AI, Redmond, WA 98052 USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To apply eyeshadow without a brush, should I use a cotton swab or a toothpick? Questions requiring this kind of physical commonsense pose a challenge to today's natural language understanding systems. While recent pretrained models (such as BERT) have made progress on question answering over more abstract domains - such as news articles and encyclopedia entries, where text is plentiful - in more physical domains, text is inherently limited due to reporting bias. Can AI systems learn to reliably answer physical commonsense questions without experiencing the physical world? In this paper, we introduce the task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA. Though humans find the dataset easy (95% accuracy), large pretrained models struggle (similar to 75%). We provide analysis about the dimensions of knowledge that existing models lack, which offers significant opportunities for future research.
引用
收藏
页码:7432 / 7439
页数:8
相关论文
共 50 条
  • [31] How to Use Language Expert to Assist Inference for Visual Commonsense Reasoning
    Song, Zijie
    Hu, Wenbo
    Ye, Hao
    Hong, Richang
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 521 - 527
  • [32] The Generalization and Robustness of Transformer-Based Language Models on Commonsense Reasoning
    Shen, Ke
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23419 - 23420
  • [33] Context-based commonsense reasoning in the DALI logic programming language
    Costantini, S
    Tocchio, A
    MODELING AND USING CONTEXT, PROCEEDINGS, 2003, 2680 : 443 - 450
  • [34] Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding
    Storks, Shane
    Gao, Qiaozi
    Zhang, Yichi
    Chai, Joyce
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4902 - 4918
  • [35] ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models
    Zhou, Kaiwen
    Lee, Kwonjoon
    Misu, Teruhisa
    Wang, Xin Eric
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 10783 - 10795
  • [36] Bridging Commonsense Reasoning and Probabilistic Planning via a Probabilistic Action Language
    Wang, Yi
    Zhang, Shiqi
    Lee, Joohyung
    THEORY AND PRACTICE OF LOGIC PROGRAMMING, 2019, 19 (5-6) : 1090 - 1106
  • [37] Confidences for Commonsense Reasoning
    Tammet, Tanel
    Draheim, Dirk
    Jarv, Priit
    AUTOMATED DEDUCTION, CADE 28, 2021, 12699 : 507 - 524
  • [38] Language Generation with Multi-Hop Reasoning on Commonsense Knowledge Graph
    Ji, Haozhe
    Ke, Pei
    Huang, Shaohan
    Wei, Furu
    Zhu, Xiaoyan
    Huang, Minlie
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 725 - 736
  • [39] Commonsense Reasoning and Explainable Artificial Intelligence Using Large Language Models
    Krause, Stefanie
    Stolzenburg, Frieder
    ARTIFICIAL INTELLIGENCE-ECAI 2023 INTERNATIONAL WORKSHOPS, PT 1, XAI3, TACTIFUL, XI-ML, SEDAMI, RAAIT, AI4S, HYDRA, AI4AI, 2023, 2024, 1947 : 302 - 319
  • [40] Commonsense Reasoning and Commonsense Knowledge in Artificial Intelligence
    Davis, Ernest
    Marcus, Gary
    COMMUNICATIONS OF THE ACM, 2015, 58 (09) : 92 - 103