Rating-Based Reinforcement Learning

被引:0
|
作者
White, Devin [1 ]
Wu, Mingkang [1 ]
Novoseller, Ellen [2 ]
Lawhern, Vernon J. [2 ]
Waytowich, Nicholas [2 ]
Cao, Yongcan [1 ]
机构
[1] Univ Texas San Antonio, San Antonio, TX 78249 USA
[2] DEVCOM Army Res Lab, Adelphi, MD USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper develops a novel rating-based reinforcement learning (RbRL) approach that uses human ratings to obtain human guidance in reinforcement learning. Different from the existing preference-based and ranking-based reinforcement learning paradigms, based on human relative preferences over sample pairs, the proposed rating-based reinforcement learning approach is based on human evaluation of individual trajectories without relative comparisons between sample pairs. The rating-based reinforcement learning approach builds on a new prediction model for human ratings and a novel multiclass loss function. We finally conduct several experimental studies based on synthetic ratings and real human ratings to evaluate the performance of the new rating-based reinforcement learning approach.
引用
收藏
页码:10207 / 10215
页数:9
相关论文
共 50 条
  • [1] Rating-based CDS curves
    Kolokolova, Olga
    Lin, Ming-Tsung
    Poon, Ser-Huang
    EUROPEAN JOURNAL OF FINANCE, 2019, 25 (07): : 689 - 723
  • [2] Do rating agencies cater? Evidence from rating-based contracts
    Kraft, Pepa
    JOURNAL OF ACCOUNTING & ECONOMICS, 2015, 59 (2-3): : 264 - 283
  • [3] A Random Forest Approach for Rating-based Recommender System
    Ajesh, A.
    Nair, Jayashree
    Jijin, P. S.
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1293 - 1297
  • [4] Measure Rating Transition of Retail Credit Portfolio of Internal Rating-based System
    Long, Quan
    Ding, Yong-Sheng
    EIGHTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS I-III, 2009, : 1064 - 1068
  • [5] Handling of individual differences in rating-based conjoint analysis
    Endrizzi, Isabella
    Menichelli, Elena
    Johansen, Susanne Bolling
    Olsen, Nina Veflen
    Naes, Tormod
    FOOD QUALITY AND PREFERENCE, 2011, 22 (03) : 241 - 254
  • [6] A new approach for rating-based collaborative music recommendation
    Tzeng, YS
    Chen, HC
    Chen, ALP
    Proceedings of the Eighth IASTED International Conference on Internet and Multimedia Systems and Applications, 2004, : 385 - 391
  • [7] Rating-Based Investment Practices and Bond Market Segmentation
    Chen, Zhihua
    Lookman, Aziz A.
    Schurhoff, Norman
    Seppi, Duane J.
    REVIEW OF ASSET PRICING STUDIES, 2014, 4 (02): : 162 - 205
  • [8] Rating-based Collaborative Filtering Combined with Additional Regularization
    Wu, Shu
    Wang, Shengrui
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1195 - 1196
  • [9] Rating-based Preference Elicitation for Recommendation of Stress Intervention
    Torkamaan, Helma
    Ziegler, Juergen
    ACM UMAP '19: PROCEEDINGS OF THE 27TH ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, 2019, : 46 - 50
  • [10] The Bond-Pricing Implications of Rating-Based Capital Requirements
    Murray, Scott
    Nikolova, Stanislava
    JOURNAL OF FINANCIAL AND QUANTITATIVE ANALYSIS, 2022, 57 (06) : 2177 - 2207