Multi-Sensor Cooperative Tracking Using Distributed Nash Q-Learning

被引：1

作者：

Cai, Jia ^{[1
]}

Huang, Changqiang ^{[1
]}

Guo, Haifeng ^{[1
]}

机构：

[1] AF Engn Univ, Aeronaut & Astronaut Engn Inst, Xian, Peoples R China

来源：

MANUFACTURING ENGINEERING AND AUTOMATION II, PTS 1-3 | 2012年 / 591-593卷

关键词：

Reinforcement learning; Nash Q-learning; Target tracking; Extended Kalman filtering; Multi-sensor cooperation; Distribution;

D O I：

10.4028/www.scientific.net/AMR.591-593.1475

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Traditional target tracking algorithm has a disadvantage of excessive dependence on the environment model. Thus a multi-sensor cooperative tracking method using distributed Nash Q-learning was proposed. Distributed Nash Q-learning with model-free was firstly described. Then sensor action and reward function were defined, which both are very crucial to the learning. Sensor action was only subjected to angle control, and reward function was given by calculating the trace of one time-step prediction error covariance. Nash tragedy can not be directly calculated, therefore, a probability statistics method using Bayesian inference was used to update the Q function. Simulation of passive tracking merely with angle measurements shows that this algorithm can enhance the adaptation to environment change and the tracking accuracy.

引用

页码：1475 / 1478

页数：4

共 50 条

[41] EFFECTS OF COMMUNICATION IN COOPERATIVE Q-LEARNING
Darbyshire, Paul
Wang, Dianhui
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (05): : 2113 - 2126
[42] Expertness based cooperative Q-learning
Ahmadabadi, MN
Asadpour, M
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01): : 66 - 76
[43] DISTRIBUTED CORRELATED Q-LEARNING FOR DYNAMIC TRANSMISSION CONTROL OF SENSOR NETWORKS
Huang, Jane Wei
Zhu, Quanyan
Krishnamurthy, Vikram
Basar, Tamer
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1982 - 1985
[44] Cooperative behavior acquisition for multi-agent systems by Q-learning
Xie, M. C.
Tachibana, A.
2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 424 - +
[45] A theoretical analysis of cooperative behaviorin multi-agent Q-learning
Waltman, Ludo
Kaymak, Uzay
2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 84 - +
[46] Cooperative Q-learning techniques for distributed online power allocation in femtocell networks
Saad, Hussein
Mohamed, Amr
ElBatt, Tamer
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2015, 15 (15): : 1929 - 1944
[47] Minimax fuzzy Q-learning in cooperative multi-agent systems
Kilic, A
Arslan, A
ADVANCES IN INFORMATION SYSTEMS, 2002, 2457 : 264 - 272
[48] Hierarchical Q-Learning Path Planning for Cooperative Tracking Control of Multi-Agent Systems With Lumped Uncertainties
Lu, Mai-Kao
Ge, Ming-Feng
Liu, Zhi-Wei
Ding, Teng-Fei
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 11
[49] Cooperative Spectrum Sensing Using Q-Learning with Experimental Validation
Chen, Zhe
Qiu, Robert C.
IEEE SOUTHEASTCON 2011: BUILDING GLOBAL ENGINEERS, 2011, : 405 - 408
[50] Distributed multi-sensor particle filter for bearings-only tracking
Zhang, Jungen
Ji, Hongbing
INTERNATIONAL JOURNAL OF ELECTRONICS, 2012, 99 (02) : 239 - 254

← 1 2 3 4 5 →