共 50 条
- [31] Safe Policy Improvement for POMDPs via Finite-State Controllers THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15109 - 15117
- [32] Decentralized Learning of Finite-Memory Policies in Dec-POMDPs IFAC PAPERSONLINE, 2023, 56 (02): : 2601 - 2607
- [33] Compositional Construction of Safety Controllers for Networks of Continuous-Space POMDPs IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 87 - 99
- [34] Reinforcement learning for POMDPs based on action values and stochastic optimization EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 199 - 204
- [35] A Role-based POMDPs Approach for Decentralized Implicit Cooperation of Multiple Agents 2017 13TH IEEE INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2017, : 496 - 501
- [40] FIXED-SIZE RECTANGULAR CONFIDENCE REGIONS COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1977, 6 (03): : 251 - 264