共 50 条
- [31] Oblivious Markov Decision Processes: Planning and Policy Execution 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3850 - 3857
- [33] Planning with Hierarchical Temporal Memory for Deterministic Markov Decision Problem ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 1073 - 1081
- [34] Learning and Planning with Timing Information in Markov Decision Processes UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 111 - 120
- [36] Lagrange Dual Decomposition for Finite Horizon Markov Decision Processes MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2011, 6911 : 487 - 502
- [37] Large Scale Markov Decision Processes with Changing Rewards ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [38] Online Regret Bounds for Markov Decision Processes with Deterministic Transitions ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2008, 5254 : 123 - 137
- [40] Online Learning in Markov Decision Processes with Changing Cost Sequences INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32