An Enterprise Multi-agent Model with Game Q-Learning Based on a Single Decision Factor

被引：1

作者：

Xu, Siying ^{[1
,2
]}

Zhang, Gaoyu ^{[2
]}

Yuan, Xianzhi ^{[3
]}

机构：

[1] Shanghai Univ Finance & Econ, Shanghai 200433, Peoples R China

[2] Shanghai Lixin Univ Accounting & Finance, Shanghai 201209, Peoples R China

[3] Chengdu Univ, Chengdu 610106, Peoples R China

来源：

COMPUTATIONAL ECONOMICS | 2024年 / 64卷 / 04期

基金：

中国国家自然科学基金;

关键词：

SMEs; Multi-agent; Q-learning; Evolutionary gaming; PRODUCT INNOVATION; EVOLUTIONARY GAME; PROTOCOL;

D O I：

10.1007/s10614-023-10524-x

中图分类号：

F [经济];

学科分类号：

02 ;

摘要：

In recent years, the study of enterprise survival development and cooperation in the whole economic market has been rapidly developed. However, in most literature studies, the traditional enterprise multi-agent cannot effectively simulate the process of enterprise survival and development since the fundamental characteristics used to describe enterprises in social networks, such as the values of enterprise multi-agent attributes, cannot be changed in process of the simulation. To address this problem, an enterprise multi-agent model based on game Q- learning to simulate enterprise decision making which aims to maximize the benefits of enterprises and optimize the effect of inter-firm cooperation is proposed in this article. The Firm Q Learning algorithm is used to dynamically change the attribute values of the enterprise multi-agent to optimize the game results in the evolutionary game model and thus effectively simulate the dynamic cooperation among the enterprise agents. The simulation result shows that the evolution of the enterprise multi-agent model based on game Q-learning can more realistically reflect the process of real enterprise survival and development than the multi-agent simulation with fixed attribute values.

引用

页码：2523 / 2562

页数：40

共 50 条

[41] Extending Q-Learning to general adaptive multi-agent systems
Tesauro, G
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 871 - 878
[42] A theoretical analysis of cooperative behaviorin multi-agent Q-learning
Waltman, Ludo
Kaymak, Uzay
2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 84 - +
[43] Multi-Agent Q-Learning for Power Allocation in Interference Channel
Wongphatcharatham, Tanutsorn
Phakphisut, Watid
Wijitpornchai, Thongchai
Areeprayoonkij, Poonlarp
Jaruvitayakovit, Tanun
Hannanta-Anan, Pimkhuan
2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 876 - 879
[44] Continuous strategy replicator dynamics for multi-agent Q-learning
Aram Galstyan
Autonomous Agents and Multi-Agent Systems, 2013, 26 : 37 - 53
[45] Minimax fuzzy Q-learning in cooperative multi-agent systems
Kilic, A
Arslan, A
ADVANCES IN INFORMATION SYSTEMS, 2002, 2457 : 264 - 272
[46] DVF:Multi-agent Q-learning with difference value factorization
Huang, Anqi
Wang, Yongli
Sang, Jianghui
Wang, Xiaoli
Wang, Yupeng
KNOWLEDGE-BASED SYSTEMS, 2024, 286
[47] Modular Production Control with Multi-Agent Deep Q-Learning
Gankin, Dennis
Mayer, Sebastian
Zinn, Jonas
Vogel-Heuser, Birgit
Endisch, Christian
2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
[48] Multi-Agent Reward-Iteration Fuzzy Q-Learning
Lixiong Leng
Jingchen Li
Jinhui Zhu
Kao-Shing Hwang
Haobin Shi
International Journal of Fuzzy Systems, 2021, 23 : 1669 - 1679
[49] Cooperative Multi-Agent Q-Learning Using Distributed MPC
Esfahani, Hossein Nejatbakhsh
Velni, Javad Mohammadpour
IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 2193 - 2198
[50] A distributed Q-learning algorithm for multi-agent team coordination
Huang, J
Yang, B
Liu, DY
Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 108 - 113

← 1 2 3 4 5 →