Rating-Based Reinforcement Learning

被引:0
|
作者
White, Devin [1 ]
Wu, Mingkang [1 ]
Novoseller, Ellen [2 ]
Lawhern, Vernon J. [2 ]
Waytowich, Nicholas [2 ]
Cao, Yongcan [1 ]
机构
[1] Univ Texas San Antonio, San Antonio, TX 78249 USA
[2] DEVCOM Army Res Lab, Adelphi, MD USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper develops a novel rating-based reinforcement learning (RbRL) approach that uses human ratings to obtain human guidance in reinforcement learning. Different from the existing preference-based and ranking-based reinforcement learning paradigms, based on human relative preferences over sample pairs, the proposed rating-based reinforcement learning approach is based on human evaluation of individual trajectories without relative comparisons between sample pairs. The rating-based reinforcement learning approach builds on a new prediction model for human ratings and a novel multiclass loss function. We finally conduct several experimental studies based on synthetic ratings and real human ratings to evaluate the performance of the new rating-based reinforcement learning approach.
引用
收藏
页码:10207 / 10215
页数:9
相关论文
共 50 条
  • [21] Topic-oriented community detection of rating-based social networks
    Reihanian, Ali
    Minaei-Bidgoli, Behrouz
    Alizadeh, Hosein
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2016, 28 (03) : 303 - 310
  • [22] Market share predictions A new model with rating-based conjoint analysis
    Guyon, Herve
    Petiot, Jean-Francois
    INTERNATIONAL JOURNAL OF MARKET RESEARCH, 2011, 53 (06) : 831 - 857
  • [23] Evaluating the effect of topic consideration in identifying communities of rating-based social networks
    Reihanian, Ali
    Minaei-Bidgoli, Behrouz
    Yousefnezhad, Muhammad
    2015 7TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2015,
  • [24] Building Feedback Rating-based Reputation System for Trusted Delivery of Cloud Services
    Yan, Chao
    Qi, Lianyong
    Ni, Jiancheng
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2014, 5 : 684 - 687
  • [25] Do Firms Manage Their Credit Ratings? Evidence from Rating-Based Contracts
    Zhang, Xia
    ACCOUNTING HORIZONS, 2018, 32 (04) : 163 - 183
  • [26] Rating-Based Recommender System Based on Textual Reviews Using IoT Smart Devices
    Ahmed, Muqeem
    Ansari, Mohd Dilshad
    Singh, Ninni
    Gunjan, Vinit Kumar
    Krishna, Santhosh B. V.
    Khan, Mudassir
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [27] A cross-validity comparison of rating-based and choice-based conjoint analysis models
    Moore, WL
    INTERNATIONAL JOURNAL OF RESEARCH IN MARKETING, 2004, 21 (03) : 299 - 312
  • [28] Mitigating Bias in GLAM Search Engines: A Simple Rating-Based Approach and Reflection
    Tian, Xinran
    Nunes, Bernardo Pereira
    Grant, Katrina
    Casanova, Marco Antonio
    34TH ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA, HT 2023, 2023,
  • [29] A Discussion on Nonlinear Models for Price Decisions in Rating-Based Product Preference Models
    Corain, Livio
    Melas, Viatcheslav B.
    Salmaso, Luigi
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2009, 38 (06) : 1178 - 1201
  • [30] Adaptive memory: Temporal, semantic, and rating-based clustering following survival processing
    Nairne, James S.
    Cogdill, Mindi
    Lehman, Melissa
    JOURNAL OF MEMORY AND LANGUAGE, 2017, 93 : 304 - 314