A survey of preference-based reinforcement learning methods

被引:0
|
作者
机构
[1] Wirth, Christian
[2] Akrour, Riad
[3] Neumann, Gerhard
[4] Fürnkranz, Johannes
来源
| 1600年 / Microtome Publishing卷 / 18期
关键词
105;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [21] Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards
    Metcalf, Katherine
    Sarabia, Miguel
    Mackraz, Natalie
    Theobald, Barry-John
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [22] VARIQuery: VAE Segment-based Active Learning for Query Selection in Preference-based Reinforcement Learning
    Marta, Daniel
    Holk, Simon
    Pek, Christian
    Tumova, Jana
    Leite, Iolanda
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7878 - 7885
  • [23] Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning
    Liu, Runze
    Bai, Fengshuo
    Du, Yali
    Yang, Yaodong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [24] A Leading Cruise Controller for Autonomous Vehicles in Mixed Autonomy Based on Preference-Based Reinforcement Learning
    Wen, Xiao
    Jian, Sisi
    He, Dengbo
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 752 - 757
  • [25] Task Decoupling in Preference-based Reinforcement Learning for Personalized Human-Robot Interaction
    Liu, Mingjiang
    Chen, Chunlin
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 848 - 855
  • [26] Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It's Complicated
    Metcalf, Katherine
    Sarabia, Miguel
    Fedzechkina, Masha
    Theobald, Barry-John
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 9, 2024, : 10128 - 10136
  • [27] Mixing corrupted preferences for robust and feedback-efficient preference-based reinforcement learning
    Heo, Jongkook
    Lee, Young Jae
    Kim, Jaehoon
    Kwak, Min Gu
    Park, Young Joon
    Kim, Seoung Bum
    KNOWLEDGE-BASED SYSTEMS, 2025, 309
  • [28] A Preference-Based Online Reinforcement Learning With Embedded Communication Failure Solutions in Smart Grid
    Dang, Yibing
    Xu, Jiangjiao
    Li, Dongdong
    Sun, Hongjian
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2025, 21 (03) : 2422 - 2431
  • [29] Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
    Chen, Xiaoyu
    Zhong, Han
    Yang, Zhuoran
    Wang, Zhaoran
    Wang, Liwei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [30] Task Transfer by Preference-Based Cost Learning
    Jing, Mingxuan
    Ma, Xiaojian
    Huang, Wenbing
    Sun, Fuchun
    Liu, Huaping
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 2471 - 2478