共 50 条
- [1] Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [2] Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 10980 - 10988
- [4] Average-Reward Decentralized Markov Decision Processes 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1997 - 2002
- [5] Robust Average-Reward Markov Decision Processes THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15215 - 15223
- [7] A Duality Approach for Regret Minimization in Average-Reward Ergodic Markov Decision Processes LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 862 - 883
- [8] A Unified Approach for Semi-Markov Decision Processes with Discounted and Average Reward Criteria 2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1741 - 1744