Multi-Sensor Cooperative Tracking Using Distributed Nash Q-Learning

被引:1
|
作者
Cai, Jia [1 ]
Huang, Changqiang [1 ]
Guo, Haifeng [1 ]
机构
[1] AF Engn Univ, Aeronaut & Astronaut Engn Inst, Xian, Peoples R China
关键词
Reinforcement learning; Nash Q-learning; Target tracking; Extended Kalman filtering; Multi-sensor cooperation; Distribution;
D O I
10.4028/www.scientific.net/AMR.591-593.1475
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional target tracking algorithm has a disadvantage of excessive dependence on the environment model. Thus a multi-sensor cooperative tracking method using distributed Nash Q-learning was proposed. Distributed Nash Q-learning with model-free was firstly described. Then sensor action and reward function were defined, which both are very crucial to the learning. Sensor action was only subjected to angle control, and reward function was given by calculating the trace of one time-step prediction error covariance. Nash tragedy can not be directly calculated, therefore, a probability statistics method using Bayesian inference was used to update the Q function. Simulation of passive tracking merely with angle measurements shows that this algorithm can enhance the adaptation to environment change and the tracking accuracy.
引用
收藏
页码:1475 / 1478
页数:4
相关论文
共 50 条
  • [41] EFFECTS OF COMMUNICATION IN COOPERATIVE Q-LEARNING
    Darbyshire, Paul
    Wang, Dianhui
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (05): : 2113 - 2126
  • [42] Expertness based cooperative Q-learning
    Ahmadabadi, MN
    Asadpour, M
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01): : 66 - 76
  • [43] DISTRIBUTED CORRELATED Q-LEARNING FOR DYNAMIC TRANSMISSION CONTROL OF SENSOR NETWORKS
    Huang, Jane Wei
    Zhu, Quanyan
    Krishnamurthy, Vikram
    Basar, Tamer
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1982 - 1985
  • [44] Cooperative behavior acquisition for multi-agent systems by Q-learning
    Xie, M. C.
    Tachibana, A.
    2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 424 - +
  • [45] A theoretical analysis of cooperative behaviorin multi-agent Q-learning
    Waltman, Ludo
    Kaymak, Uzay
    2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 84 - +
  • [46] Cooperative Q-learning techniques for distributed online power allocation in femtocell networks
    Saad, Hussein
    Mohamed, Amr
    ElBatt, Tamer
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2015, 15 (15): : 1929 - 1944
  • [47] Minimax fuzzy Q-learning in cooperative multi-agent systems
    Kilic, A
    Arslan, A
    ADVANCES IN INFORMATION SYSTEMS, 2002, 2457 : 264 - 272
  • [48] Hierarchical Q-Learning Path Planning for Cooperative Tracking Control of Multi-Agent Systems With Lumped Uncertainties
    Lu, Mai-Kao
    Ge, Ming-Feng
    Liu, Zhi-Wei
    Ding, Teng-Fei
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 11
  • [49] Cooperative Spectrum Sensing Using Q-Learning with Experimental Validation
    Chen, Zhe
    Qiu, Robert C.
    IEEE SOUTHEASTCON 2011: BUILDING GLOBAL ENGINEERS, 2011, : 405 - 408
  • [50] Distributed multi-sensor particle filter for bearings-only tracking
    Zhang, Jungen
    Ji, Hongbing
    INTERNATIONAL JOURNAL OF ELECTRONICS, 2012, 99 (02) : 239 - 254