Modeling the formation of social conventions from embodied real-time interactions

被引:7
|
作者
Freire, Ismael T. [1 ]
Moulin-Frier, Clement [2 ,3 ]
Sanchez-Fibla, Marti [4 ]
Arsiwalla, Xerxes D. [1 ]
Verschure, Paul F. M. J. [1 ,5 ,6 ]
机构
[1] Inst Bioengn Catalonia, SPECS Lab, Barcelona, Spain
[2] INRIA, Flowers Team, Bordeaux, France
[3] Ensta ParisTech, Bordeaux, France
[4] Univ Pompeu Fabra, ETIC, AI ML Grp, Barcelona, Spain
[5] Barcelona Inst Sci & Technol BIST, Barcelona, Spain
[6] Catalan Inst Res & Adv Studies ICREA, Barcelona, Spain
来源
PLOS ONE | 2020年 / 15卷 / 06期
基金
欧盟地平线“2020”;
关键词
COMMUNICATION; COOPERATION; ARCHITECTURE; RECIPROCITY; EVOLUTION; FRAMEWORK; SYSTEMS; BRAIN; GAMES;
D O I
10.1371/journal.pone.0234434
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
What is the role of real-time control and learning in the formation of social conventions? To answer this question, we propose a computational model that matches human behavioral data in a social decision-making game that was analyzed both in discrete-time and continuous-time setups. Furthermore, unlike previous approaches, our model takes into account the role of sensorimotor control loops in embodied decision-making scenarios. For this purpose, we introduce the Control-based Reinforcement Learning (CRL) model. CRL is grounded in the Distributed Adaptive Control (DAC) theory of mind and brain, where low-level sensorimotor control is modulated through perceptual and behavioral learning in a layered structure. CRL follows these principles by implementing a feedback control loop handling the agent's reactive behaviors (pre-wired reflexes), along with an Adaptive Layer that uses reinforcement learning to maximize long-term reward. We test our model in a multi-agent game-theoretic task in which coordination must be achieved to find an optimal solution. We show that CRL is able to reach human-level performance on standard game-theoretic metrics such as efficiency in acquiring rewards and fairness in reward distribution.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] REAL-TIME MODELING OF POWER NETWORKS
    BOSE, A
    CLEMENTS, KA
    PROCEEDINGS OF THE IEEE, 1987, 75 (12) : 1607 - 1622
  • [32] Real-time modeling with architectural space
    Kurmann, D
    Elte, N
    Engeli, M
    CAAD FUTURES 1997, 1997, : 809 - 819
  • [33] Real-time furnace modeling and diagnostics
    Wang, JX
    Spanos, CJ
    IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2002, 15 (04) : 393 - 403
  • [34] Real-time ocean modeling systems
    Wallcraft, AJ
    Hurlburt, HE
    Metzger, EJ
    Rhodes, RC
    Shriver, JF
    Smedstad, OM
    COMPUTING IN SCIENCE & ENGINEERING, 2002, 4 (02) : 50 - 57
  • [35] Modeling a complex real-time system
    Happonen, Ari
    Porras, Jari
    4TH INTERNATIONAL INDUSTRIAL SIMULATION CONFERENCE 2006, 2006, : 175 - +
  • [36] AUTOMATA FOR MODELING REAL-TIME SYSTEMS
    ALUR, R
    DILL, D
    LECTURE NOTES IN COMPUTER SCIENCE, 1990, 443 : 322 - 335
  • [37] A Real-time Modeling of Photovoltaic Array
    Wang Wei
    Li Ning
    Li Shaoyuan
    CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2012, 20 (06) : 1154 - 1160
  • [38] Modeling and Estimation for Real-Time Microarrays
    Vikalo, Haris
    Hassibi, Babak
    Hassibi, Arjang
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2008, 2 (03) : 286 - 296
  • [39] Modeling and real-time simulation of UPFC
    Kimura, M
    Takahashi, C
    Kishibe, H
    Miyazaki, Y
    Noro, Y
    Iio, N
    ELECTRICAL ENGINEERING IN JAPAN, 2006, 155 (01) : 19 - 26
  • [40] Modeling fibril fragmentation in real-time
    Tan, Pengzhen
    Hong, Liu
    JOURNAL OF CHEMICAL PHYSICS, 2013, 139 (08):