共 50 条
- [1] Online Learning in Markov Decision Processes with Changing Cost Sequences INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
- [2] Large Scale Markov Decision Processes with Changing Rewards ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [3] Blackwell Online Learning for Markov Decision Processes 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
- [4] Online Learning in Kernelized Markov Decision Processes 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
- [5] Arbitrarily Modulated Markov Decision Processes PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 2946 - 2953
- [6] Markov Decision Processes with Functional Rewards MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, 2013, 8271 : 269 - 280
- [7] Online Regret Bounds for Markov Decision Processes with Deterministic Transitions ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2008, 5254 : 123 - 137
- [9] Online Learning of Safety function for Markov Decision Processes 2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
- [10] Online Learning in Markov Decision Processes with Continuous Actions ALGORITHMIC LEARNING THEORY, ALT 2015, 2015, 9355 : 302 - 316