TEMPORAL UNCERTAINTY OF REINFORCEMENT

被引:0
|
作者
ROYER, FL
机构
来源
PSYCHONOMIC SCIENCE | 1969年 / 15卷 / 05期
关键词
D O I
10.3758/BF03337417
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
引用
收藏
页码:269 / &
相关论文
共 50 条
  • [31] Uncertainty Aware Model Integration on Reinforcement Learning
    Nagata, Takashi
    Xing, Jinwei
    Kumazawa, Tsutomu
    Neftci, Emre
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [32] Discrete Uncertainty Quantification For Offline Reinforcement Learning
    Perez, Jose Luis
    Corrochano, Javier
    Garcia, Javier
    Majadas, Ruben
    Ibanez-Llano, Cristina
    Perez, Sergio
    Fernandez, Fernando
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2023, 13 (04) : 273 - 287
  • [34] Uncertainty Propagation for Quality Assurance in Reinforcement Learning
    Schneegass, Daniel
    Udluft, Steffen
    Martinetz, Thomas
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 2588 - 2595
  • [35] Reinforcement uncertainty enhances preference for choice in humans
    Rost, Kristen A.
    JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR, 2018, 110 (02) : 201 - 212
  • [36] Neurorobotic reinforcement learning for domains with parametrical uncertainty
    Amaya, Camilo
    von Arnim, Axel
    FRONTIERS IN NEUROROBOTICS, 2023, 17
  • [37] Navigating Uncertainty in Epidemic Contexts with Reinforcement Learning
    Ondula, Elizabeth Akinyi
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23407 - 23408
  • [38] Reinforcement Learning With Temporal Logic Rewards
    Li, Xiao
    Vasile, Cristian-Ioan
    Belta, Calin
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3834 - 3839
  • [39] TEMPORAL CONTIGUITY - IS IT A SUFFICIENT CONDITION FOR REINFORCEMENT
    HARRISON, RG
    SCHAEFFER, RW
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1975, 5 (03) : 230 - 232
  • [40] Posterior Weighted Reinforcement Learning with State Uncertainty
    Larsen, Tobias
    Leslie, David S.
    Collins, Edmund J.
    Bogacz, Rafal
    NEURAL COMPUTATION, 2010, 22 (05) : 1149 - 1179