共 50 条
- [42] Online Learning in Markov Decision Processes with Changing Cost Sequences INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
- [45] Hierarchical Method for Cooperative Multiagent Reinforcement Learning in Markov Decision Processes Doklady Mathematics, 2023, 108 : S382 - S392
- [46] Model-Free Reinforcement Learning for Branching Markov Decision Processes COMPUTER AIDED VERIFICATION, PT II, CAV 2021, 2021, 12760 : 651 - 673
- [47] Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [49] Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9794 - 9801
- [50] An Inverse Reinforcement Learning Algorithm for semi-Markov Decision Processes 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 1256 - 1261