Optimizing Quantiles in Preference-Based Markov Decision Processes

被引:0
|
作者
Gilbert, Hugo [1 ]
Weng, Paul [2 ,3 ,4 ]
Xu, Yan [2 ,3 ,4 ]
机构
[1] UPMC Univ Paris 06, Sorbonne Univ, CNRS, LIP6,UMR 7606, Paris, France
[2] SYSU CMU Joint Inst Engn, Guangzhou, Guangdong, Peoples R China
[3] Sch Elect & Informat Technol, Guangzhou, Guangdong, Peoples R China
[4] SYSU CMU Shunde Int Joint Res Inst, Shunde, Peoples R China
关键词
MINIMIZING RISK MODELS; VARIANCE; UTILITY; POLICY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the Markov decision process model, policies are usually evaluated by expected cumulative rewards. As this decision criterion is not always suitable, we propose in this paper an algorithm for computing a policy optimal for the quantile criterion. Both finite and infinite horizons are considered. Finally we experimentally evaluate our approach on random MDPs and on a data center control problem.
引用
收藏
页码:3569 / 3575
页数:7
相关论文
共 50 条
  • [21] A hybrid fuzzy-neuro model for preference-based decision analysis
    Lee, VCS
    Sim, ATH
    INTELLIGENT DAA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 449 - 456
  • [22] Preference-Based Stream Analysis for Efficient Decision-Support Systems
    Rudenko, Lena
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2017, 2017, 767 : 397 - 409
  • [23] Interaction and risk preference-based group decision-making with intuitionistic fuzzy preference relations
    Wei, Ying
    Gong, Kaixin
    Chen, Chunfang
    Zhu, Xianghong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (03) : 6185 - 6199
  • [24] Optimizing exploration strategies through decomposed and factored Markov Decision Processes
    Teichteil-Königsbuch, Florent
    Fabiani, Patrick
    Revue d'Intelligence Artificielle, 2006, 20 (2-3) : 133 - 179
  • [25] Preference-based multi-objectivization applied to decision support for High-Pressure Thermal processes in food treatment
    Fernandez, Miriam R.
    Redondo, Juana L.
    Ivorra, Benjamin
    Ramos, Angel M.
    Ortigosa, Pilar M.
    APPLIED SOFT COMPUTING, 2019, 79 : 326 - 340
  • [26] Preference-based belief operators
    Asheim, GB
    Sovik, Y
    MATHEMATICAL SOCIAL SCIENCES, 2005, 50 (01) : 61 - 82
  • [27] Applying Preference-based Customization
    Liaskos, Sotirios
    Rogozhkin, Vyacheslav
    2011 19TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE), 2011, : 353 - +
  • [28] Preference-based group scheduling
    Hu, J
    Brzozowski, M
    HUMAN-COMPUTER INTERACTION - INTERACT 2005, PROCEEDINGS, 2005, 3585 : 990 - 993
  • [29] Preference-Based Trajectory Generation
    Lennon, Jamie A.
    Atkins, Ella M.
    JOURNAL OF AEROSPACE COMPUTING INFORMATION AND COMMUNICATION, 2009, 6 (03): : 142 - 170
  • [30] Preference-based search for scheduling
    Junker, U
    SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), 2000, : 904 - 909