Aberrant reward learning, but not negative reinforcement learning, is related to depressive symptoms: an attentional perspective

被引：3

作者：

Hertz-Palmor, Nimrod ^{[1
,2
]}

Rozenblit, Danielle ^{[1
]}

Lavi, Shani ^{[1
]}

Zeltser, Jonathan ^{[1
]}

Kviatek, Yonatan ^{[1
]}

Lazarov, Amit ^{[1
,3
]}

机构：

[1] Tel Aviv Univ, Sch Psychol Sci, Tel Aviv, Israel

[2] Univ Cambridge, MRC Cognit & Brain Sci Unit, Cambridge, England

[3] Columbia Univ, Irving Med Ctr, Dept Psychiat, New York, NY 10027 USA

来源：

PSYCHOLOGICAL MEDICINE | 2024年 / 54卷 / 04期

基金：

以色列科学基金会;

关键词：

anhedonia; attention allocation; depression symptoms; negative reinforcement; positive reinforcement; reward learning; selection history; SOCIAL ANXIETY DISORDER; VALUE-DRIVEN ATTENTION; EYE-TRACKING; BIAS MODIFICATION; INDIVIDUALS; ANHEDONIA; INFORMATION; RESPONSES; STIMULI; STRESS;

D O I：

10.1017/S0033291723002519

中图分类号：

B849 [应用心理学];

学科分类号：

040203 ;

摘要：

Background Aberrant reward functioning is implicated in depression. While attention precedes behavior and guides higher-order cognitive processes, reward learning from an attentional perspective - the effects of prior reward-learning on subsequent attention allocation - has been mainly overlooked.Methods The present study explored the effects of reward-based attentional learning in depression using two separate, yet complimentary, studies. In study 1, participants with high (HD) and low (LD) levels of depression symptoms were trained to divert their gaze toward one type of stimuli over another using a novel gaze-contingent music reward paradigm - music played when fixating the desired stimulus type and stopped when gazing the alternate one. Attention allocation was assessed before, during, and following training. In study 2, using negative reinforcement, the same attention allocation pattern was trained while substituting the appetitive music reward for gazing the desired stimulus type with the removal of an aversive sound (i.e. white noise).Results In study 1 both groups showed the intended shift in attention allocation during training (online reward learning), while generalization of learning at post-training was only evident among LD participants. Conversely, in study 2 both groups showed post-training generalization. Results were maintained when introducing anxiety as a covariate, and when using a more powerful sensitivity analysis. Finally, HD participants showed higher learning speed than LD participants during initial online learning, but only when using negative, not positive, reinforcement.Conclusions Deficient generalization of learning characterizes the attentional system of HD individuals, but only when using reward-based positive reinforcement, not negative reinforcement.

引用

页码：794 / 807

页数：14

共 50 条

[31] Compatible Reward Inverse Reinforcement Learning
Metelli, Alberto Maria
Pirotta, Matteo
Restelli, Marcello
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[32] Hierarchical average reward reinforcement learning
Department of Computing Science, University of Alberta, Edmonton, Alta. T6G 2E8, Canada
不详
Journal of Machine Learning Research, 2007, 8 : 2629 - 2669
[33] Distributional Reward Decomposition for Reinforcement Learning
Lin, Zichuan
Zhao, Li
Yang, Derek
Qin, Tao
Yang, Guangwen
Liu, Tie-Yan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[34] Reward learning: Reinforcement, incentives, and expectations
Berridge, KC
PSYCHOLOGY OF LEARNING AND MOTIVATION: ADVANCES IN RESEARCH AND THEORY, VOL 40, 2001, 40 : 223 - 278
[35] Learning to attend and ignore: The influence of reward learning on attentional capture and suppression
Pearson, Daniel
Whitford, Thomas
Le Pelley, Mike
PERCEPTION, 2016, 45 : 356 - 356
[36] Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Gu, Shangding
Sel, Bilgehan
Ding, Yuhao
Wang, Lu
Lin, Qingwei
Jin, Ming
Knoll, Alois
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21099 - 21106
[37] Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective
Everitt, Tom
Hutter, Marcus
Kumar, Ramana
Krakovna, Victoria
SYNTHESE, 2021, 198 (SUPPL 27) : 6435 - 6467
[38] Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective
Tom Everitt
Marcus Hutter
Ramana Kumar
Victoria Krakovna
Synthese, 2021, 198 : 6435 - 6467
[39] Learning reward machines: A study in partially observable reinforcement learning
Icarte, Rodrigo Toro
Klassen, Toryn Q.
Valenzano, Richard
Castro, Margarita P.
Waldie, Ethan
Mcilraith, Sheila A.
ARTIFICIAL INTELLIGENCE, 2023, 323
[40] Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation
Gao, Yang
Meyer, Christian M.
Mesgar, Mohsen
Gurevych, Iryna
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2350 - 2356

← 1 2 3 4 5 →