共 50 条
- [1] Local Analysis of Entropy-Regularized Stochastic Soft-Max Policy Gradient Methods 2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
- [2] Cooperative Multi-agent Policy Gradient MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT I, 2019, 11051 : 459 - 476
- [3] Evolutionary Dynamics of Multi-agent Formation CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 3557 - 3561
- [4] MAPPG: Multi-agent Phasic Policy Gradient 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 2366 - 2371
- [5] Evolutionary Dynamics of Multi-Agent Learning: A Survey JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2015, 53 : 659 - 697
- [6] TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17496 - 17504
- [7] Multi-category classification by soft-max combination of binary classifiers MULTIPLE CLASSIFIER SYSTEMS, PROCEEDING, 2003, 2709 : 125 - 134
- [8] Blameworthiness in Multi-Agent Settings THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 525 - 532
- [9] Twin Delayed Multi-Agent Deep Deterministic Policy Gradient PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 48 - 52
- [10] A learning automata approach to multi-agent policy gradient learning KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2008, 5178 : 379 - +