共 18 条
- [11] Rangwala M, Williams R., Learning multi-agent communication through structured attentive reasoning, Proceedings of the 34th International Conference on Neural Information Processing Systems, pp. 10088-10098, (2020)
- [12] Ding Z L, Huang T J, Lu Z Q., Learning individually inferred communication for multi-agent cooperation[J/OL], (2020)
- [13] Liu I J, Jain U, Yeh R A, Et al., Cooperative exploration for multi-agent deep reinforcement learning[J/OL], (2021)
- [14] Pesce E, Montana G., Improving coordination in small-scale multi-agent deep reinforcement learning through memory-driven communication, Machine Learning, 109, 9, pp. 1727-1747, (2020)
- [15] Kuba J G, Chen R Q, Wen M N, Et al., Trust region policy optimisation in multi-agent reinforcement learning[J/OL], (2021)
- [16] Hu J, Hu S Y, Liao S W., Policy regularization via noisy advantage values for cooperative multi-agent actor-critic methods[J/OL], (2021)
- [17] Iqbal S, Sha F., Actor-attention-critic for multi-agent reinforcement learning, (2018)
- [18] Kuba J G, Chen R Q, Wen M N, Et al., Trust region policy optimisation in multi-agent reinforcement learning[J/OL], (2021)