Utility Maximizing Sequential Sensing Over a Finite Horizon

被引:8
|
作者
Ferrari, Lorenzo [1 ]
Zhao, Qing [2 ]
Scaglione, Anna [1 ]
机构
[1] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85281 USA
[2] Cornell Univ, Sch Elect Engn, Phillips Hall, Ithaca, NY 14853 USA
基金
美国国家科学基金会;
关键词
Optimum sequential testing; opportunistic spectrum access; multi-channel sensing; cognitive radio; ALGORITHMS; NETWORKS; DESIGN; ACCESS; MAC;
D O I
10.1109/TSP.2017.2692725
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We consider the problem of optimally utilizing N resources, each in an unknown binary state. The state of each resource can be inferred from state-dependent noisy measurements. Depending on its state, utilizing a resource results in either a reward or a penalty per unit time. The objective is a sequential strategy governing the decision of sensing and exploitation at each time to maximize the expected utility (i.e., total reward minus total penalty and sensing cost) over a finite horizon L. We formulate the problem as a partially observable Markov decision process and show that the optimal strategy is based on two time-varying thresholds for each resource and an optimal selection rule to sense a particular resource. Since a full characterization of the optimal strategy is generally intractable, we develop a low-complexity policy that is shown by simulations to offer a near optimal performance. This problem finds applications in opportunistic spectrum access, marketing strategies, and other sequential resource allocation problems.
引用
收藏
页码:3430 / 3445
页数:16
相关论文
共 50 条