共 50 条
- [42] First Passage Risk Probability Minimization for Piecewise Deterministic Markov Decision Processes ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2022, 38 (03): : 549 - 567
- [44] Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 10980 - 10988
- [45] √n-Regret for Learning in Markov Decision Processes with Function Approximation and Low Bellman Rank CONFERENCE ON LEARNING THEORY, VOL 125, 2020, 125
- [46] Minimax-Regret Querying on Side Effects for Safe Optimality in Factored Markov Decision Processes PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4867 - 4873
- [47] A Sublinear-Regret Reinforcement Learning Algorithm on Constrained Markov Decision Processes with reset action ICMLSC 2020: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, 2020, : 51 - 55