An Enterprise Multi-agent Model with Game Q-Learning Based on a Single Decision Factor

被引:1
|
作者
Xu, Siying [1 ,2 ]
Zhang, Gaoyu [2 ]
Yuan, Xianzhi [3 ]
机构
[1] Shanghai Univ Finance & Econ, Shanghai 200433, Peoples R China
[2] Shanghai Lixin Univ Accounting & Finance, Shanghai 201209, Peoples R China
[3] Chengdu Univ, Chengdu 610106, Peoples R China
基金
中国国家自然科学基金;
关键词
SMEs; Multi-agent; Q-learning; Evolutionary gaming; PRODUCT INNOVATION; EVOLUTIONARY GAME; PROTOCOL;
D O I
10.1007/s10614-023-10524-x
中图分类号
F [经济];
学科分类号
02 ;
摘要
In recent years, the study of enterprise survival development and cooperation in the whole economic market has been rapidly developed. However, in most literature studies, the traditional enterprise multi-agent cannot effectively simulate the process of enterprise survival and development since the fundamental characteristics used to describe enterprises in social networks, such as the values of enterprise multi-agent attributes, cannot be changed in process of the simulation. To address this problem, an enterprise multi-agent model based on game Q- learning to simulate enterprise decision making which aims to maximize the benefits of enterprises and optimize the effect of inter-firm cooperation is proposed in this article. The Firm Q Learning algorithm is used to dynamically change the attribute values of the enterprise multi-agent to optimize the game results in the evolutionary game model and thus effectively simulate the dynamic cooperation among the enterprise agents. The simulation result shows that the evolution of the enterprise multi-agent model based on game Q-learning can more realistically reflect the process of real enterprise survival and development than the multi-agent simulation with fixed attribute values.
引用
收藏
页码:2523 / 2562
页数:40
相关论文
共 50 条
  • [41] Extending Q-Learning to general adaptive multi-agent systems
    Tesauro, G
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 871 - 878
  • [42] A theoretical analysis of cooperative behaviorin multi-agent Q-learning
    Waltman, Ludo
    Kaymak, Uzay
    2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 84 - +
  • [43] Multi-Agent Q-Learning for Power Allocation in Interference Channel
    Wongphatcharatham, Tanutsorn
    Phakphisut, Watid
    Wijitpornchai, Thongchai
    Areeprayoonkij, Poonlarp
    Jaruvitayakovit, Tanun
    Hannanta-Anan, Pimkhuan
    2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 876 - 879
  • [44] Continuous strategy replicator dynamics for multi-agent Q-learning
    Aram Galstyan
    Autonomous Agents and Multi-Agent Systems, 2013, 26 : 37 - 53
  • [45] Minimax fuzzy Q-learning in cooperative multi-agent systems
    Kilic, A
    Arslan, A
    ADVANCES IN INFORMATION SYSTEMS, 2002, 2457 : 264 - 272
  • [46] DVF:Multi-agent Q-learning with difference value factorization
    Huang, Anqi
    Wang, Yongli
    Sang, Jianghui
    Wang, Xiaoli
    Wang, Yupeng
    KNOWLEDGE-BASED SYSTEMS, 2024, 286
  • [47] Modular Production Control with Multi-Agent Deep Q-Learning
    Gankin, Dennis
    Mayer, Sebastian
    Zinn, Jonas
    Vogel-Heuser, Birgit
    Endisch, Christian
    2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
  • [48] Multi-Agent Reward-Iteration Fuzzy Q-Learning
    Lixiong Leng
    Jingchen Li
    Jinhui Zhu
    Kao-Shing Hwang
    Haobin Shi
    International Journal of Fuzzy Systems, 2021, 23 : 1669 - 1679
  • [49] Cooperative Multi-Agent Q-Learning Using Distributed MPC
    Esfahani, Hossein Nejatbakhsh
    Velni, Javad Mohammadpour
    IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 2193 - 2198
  • [50] A distributed Q-learning algorithm for multi-agent team coordination
    Huang, J
    Yang, B
    Liu, DY
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 108 - 113