共 50 条
- [31] Stateful Posted Pricing with Vanishing Regret via Dynamic Deterministic Markov Decision Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [32] From minimax value to low-regret algorithms for online Markov decision processes 2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 471 - 476
- [35] Improved Regret for Differentially Private Exploration in Linear MDP INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [36] Adaptive strategies and regret minimization in arbitrarily varying Markov environments COMPUTATIONAL LEARNING THEORY, PROCEEDINGS, 2001, 2111 : 128 - 142
- [39] Variance minimization for continuous-time Markov decision processes: two approaches Applied Mathematics-A Journal of Chinese Universities, 2010, 25 : 400 - 410
- [40] First Passage Risk Probability Minimization for Piecewise Deterministic Markov Decision Processes Acta Mathematicae Applicatae Sinica, English Series, 2022, 38 : 549 - 567