Multi-Sensor Cooperative Tracking Using Distributed Nash Q-Learning

被引:1
|
作者
Cai, Jia [1 ]
Huang, Changqiang [1 ]
Guo, Haifeng [1 ]
机构
[1] AF Engn Univ, Aeronaut & Astronaut Engn Inst, Xian, Peoples R China
关键词
Reinforcement learning; Nash Q-learning; Target tracking; Extended Kalman filtering; Multi-sensor cooperation; Distribution;
D O I
10.4028/www.scientific.net/AMR.591-593.1475
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional target tracking algorithm has a disadvantage of excessive dependence on the environment model. Thus a multi-sensor cooperative tracking method using distributed Nash Q-learning was proposed. Distributed Nash Q-learning with model-free was firstly described. Then sensor action and reward function were defined, which both are very crucial to the learning. Sensor action was only subjected to angle control, and reward function was given by calculating the trace of one time-step prediction error covariance. Nash tragedy can not be directly calculated, therefore, a probability statistics method using Bayesian inference was used to update the Q function. Simulation of passive tracking merely with angle measurements shows that this algorithm can enhance the adaptation to environment change and the tracking accuracy.
引用
收藏
页码:1475 / 1478
页数:4
相关论文
共 50 条
  • [1] Cooperative Multi-Agent Q-Learning Using Distributed MPC
    Esfahani, Hossein Nejatbakhsh
    Velni, Javad Mohammadpour
    IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 2193 - 2198
  • [2] Multi-sensor tracking by cooperative processors
    Mallaina, EF
    Frías, BC
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING IX, 2004, 5238 : 504 - 511
  • [3] Enhancing Nash Q-learning and Team Q-learning mechanisms by using bottlenecks
    Ghazanfari, Behzad
    Mozayani, Nasser
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2014, 26 (06) : 2771 - 2783
  • [4] Dynamic obstacle avoidance based on multi-sensor fusion and Q-learning algorithm
    Zhang, Yi
    Wei, Xin
    Zhou, Xiangyu
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1569 - 1573
  • [5] Cooperative Spectrum Sensing for Cognitive Radios using Distributed Q-Learning
    van den Biggelaar, Olivier
    Dricot, Jean-Michel
    De Doncker, Philippe
    Horlin, Francois
    2011 IEEE VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2011,
  • [6] Multi Target Tracking using a Compact Q-Learning with a Teacher
    Saad, E. M.
    Awadalla, M. H.
    Hamdy, A. M.
    Ali, H. I.
    ICCES: 2008 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS, 2007, : 173 - 178
  • [7] Distributed lazy Q-learning for cooperative mobile robots
    Touzet, Claude F.
    International Journal of Advanced Robotic Systems, 2004, 1 (01) : 5 - 13
  • [8] Informative Path Planning for Multi-UUV Cooperative Search with Distributed Q-Learning
    Han, Zhengqing
    Song, Guanglei
    Sun, Qi
    Jiao, Huifeng
    Wang, Yintao
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT VI, 2025, 15206 : 80 - 93
  • [9] Multi-Target Tracking using a Compact Q-Learning with a Teacher
    Saad, E. M.
    Awadalla, M. H.
    Hamdy, A. M.
    Ali, H. I.
    NRSC: 2009 NATIONAL RADIO SCIENCE CONFERENCE: NRSC 2009, VOLS 1 AND 2, 2009, : 284 - 295
  • [10] Distributed multi-sensor multi-target tracking with feedback
    Khawsuk, W
    Pao, LY
    PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 5356 - 5362