共 50 条
- [1] Blackwell Online Learning for Markov Decision Processes 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
- [2] Online Learning of Safety function for Markov Decision Processes 2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
- [3] Online Learning in Markov Decision Processes with Continuous Actions ALGORITHMIC LEARNING THEORY, ALT 2015, 2015, 9355 : 302 - 316
- [5] Kernelized Q-Learning for Large-Scale, Potentially Continuous, Markov Decision Processes 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 153 - 162
- [6] Online Learning in Markov Decision Processes with Changing Cost Sequences INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
- [7] Online Learning with Implicit Exploration in Episodic Markov Decision Processes 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1953 - 1958
- [8] Online Learning in Markov Decision Processes with Arbitrarily Changing Rewards and Transitions 2009 INTERNATIONAL CONFERENCE ON GAME THEORY FOR NETWORKS (GAMENETS 2009), 2009, : 314 - 322
- [10] A Structure-aware Online Learning Algorithm for Markov Decision Processes PROCEEDINGS OF THE 12TH EAI INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS (VALUETOOLS 2019), 2019, : 71 - 78