Aberrant reward learning, but not negative reinforcement learning, is related to depressive symptoms: an attentional perspective

被引:3
|
作者
Hertz-Palmor, Nimrod [1 ,2 ]
Rozenblit, Danielle [1 ]
Lavi, Shani [1 ]
Zeltser, Jonathan [1 ]
Kviatek, Yonatan [1 ]
Lazarov, Amit [1 ,3 ]
机构
[1] Tel Aviv Univ, Sch Psychol Sci, Tel Aviv, Israel
[2] Univ Cambridge, MRC Cognit & Brain Sci Unit, Cambridge, England
[3] Columbia Univ, Irving Med Ctr, Dept Psychiat, New York, NY 10027 USA
基金
以色列科学基金会;
关键词
anhedonia; attention allocation; depression symptoms; negative reinforcement; positive reinforcement; reward learning; selection history; SOCIAL ANXIETY DISORDER; VALUE-DRIVEN ATTENTION; EYE-TRACKING; BIAS MODIFICATION; INDIVIDUALS; ANHEDONIA; INFORMATION; RESPONSES; STIMULI; STRESS;
D O I
10.1017/S0033291723002519
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
Background Aberrant reward functioning is implicated in depression. While attention precedes behavior and guides higher-order cognitive processes, reward learning from an attentional perspective - the effects of prior reward-learning on subsequent attention allocation - has been mainly overlooked.Methods The present study explored the effects of reward-based attentional learning in depression using two separate, yet complimentary, studies. In study 1, participants with high (HD) and low (LD) levels of depression symptoms were trained to divert their gaze toward one type of stimuli over another using a novel gaze-contingent music reward paradigm - music played when fixating the desired stimulus type and stopped when gazing the alternate one. Attention allocation was assessed before, during, and following training. In study 2, using negative reinforcement, the same attention allocation pattern was trained while substituting the appetitive music reward for gazing the desired stimulus type with the removal of an aversive sound (i.e. white noise).Results In study 1 both groups showed the intended shift in attention allocation during training (online reward learning), while generalization of learning at post-training was only evident among LD participants. Conversely, in study 2 both groups showed post-training generalization. Results were maintained when introducing anxiety as a covariate, and when using a more powerful sensitivity analysis. Finally, HD participants showed higher learning speed than LD participants during initial online learning, but only when using negative, not positive, reinforcement.Conclusions Deficient generalization of learning characterizes the attentional system of HD individuals, but only when using reward-based positive reinforcement, not negative reinforcement.
引用
收藏
页码:794 / 807
页数:14
相关论文
共 50 条
  • [31] Compatible Reward Inverse Reinforcement Learning
    Metelli, Alberto Maria
    Pirotta, Matteo
    Restelli, Marcello
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [32] Hierarchical average reward reinforcement learning
    Department of Computing Science, University of Alberta, Edmonton, Alta. T6G 2E8, Canada
    不详
    Journal of Machine Learning Research, 2007, 8 : 2629 - 2669
  • [33] Distributional Reward Decomposition for Reinforcement Learning
    Lin, Zichuan
    Zhao, Li
    Yang, Derek
    Qin, Tao
    Yang, Guangwen
    Liu, Tie-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [34] Reward learning: Reinforcement, incentives, and expectations
    Berridge, KC
    PSYCHOLOGY OF LEARNING AND MOTIVATION: ADVANCES IN RESEARCH AND THEORY, VOL 40, 2001, 40 : 223 - 278
  • [35] Learning to attend and ignore: The influence of reward learning on attentional capture and suppression
    Pearson, Daniel
    Whitford, Thomas
    Le Pelley, Mike
    PERCEPTION, 2016, 45 : 356 - 356
  • [36] Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
    Gu, Shangding
    Sel, Bilgehan
    Ding, Yuhao
    Wang, Lu
    Lin, Qingwei
    Jin, Ming
    Knoll, Alois
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21099 - 21106
  • [37] Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective
    Everitt, Tom
    Hutter, Marcus
    Kumar, Ramana
    Krakovna, Victoria
    SYNTHESE, 2021, 198 (SUPPL 27) : 6435 - 6467
  • [38] Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective
    Tom Everitt
    Marcus Hutter
    Ramana Kumar
    Victoria Krakovna
    Synthese, 2021, 198 : 6435 - 6467
  • [39] Learning reward machines: A study in partially observable reinforcement learning 
    Icarte, Rodrigo Toro
    Klassen, Toryn Q.
    Valenzano, Richard
    Castro, Margarita P.
    Waldie, Ethan
    Mcilraith, Sheila A.
    ARTIFICIAL INTELLIGENCE, 2023, 323
  • [40] Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation
    Gao, Yang
    Meyer, Christian M.
    Mesgar, Mohsen
    Gurevych, Iryna
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2350 - 2356