Enhancing Noisy Binary Search Efficiency through Deep Reinforcement Learning

被引：0

作者：

Ma, Rui ^{[1
]}

Tao, Yudong ^{[1
]}

Khodeiry, Mohamed M. ^{[2
]}

Alawa, Karam A. ^{[2
]}

Shyu, Mei-Ling ^{[3
]}

Lee, Richard K. ^{[1
,2
]}

机构：

[1] Univ Miami, Dept Elect & Comp Engn, Coral Gables, FL USA

[2] Univ Miami, Bascom Palmer Eye Inst, Miller Sch Med, Miami, FL USA

[3] Univ Missouri Kansas City, Sch Sci & Engn, Kansas City, MO USA

来源：

2023 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI | 2023年

关键词：

Deep learning; deep reinforcement learning; noisy binary search;

D O I：

10.1109/IRI58017.2023.00033

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Noisy binary search (NBS) aims to find the closest element to a target value within a sorted array through erroneous queries. In an ideal NBS environment where the error rate remains constant, and the costs of all queries are the same, the maximum likelihood estimation (MLE) procedure has been proven to be the optimal decision strategy. However, in some non-ideal NBS problems, both the error rates and the costs are dependent on the queries, and in some cases, finding the optimal decision strategies can be intractable. We propose to use deep reinforcement learning to approximate the optimal decision strategy in the NBS problem, in which an intelligent agent is used to interact with the NBS environment. A dueling double deep Q-network guides the agent to take action at each step, either to generate a query or to stop the search and predict the target value. An optimized policy will be derived by training the network in the NBS environment until convergence. By evaluating our proposed algorithm on a non-ideal NBS environment, visual field test, we show that the performance of our proposed algorithm surpasses baseline visual field testing algorithms by a large margin.

引用

页码：154 / 159

页数：6

共 50 条

[21] Enhancing cut selection through reinforcement learning
Shengchao Wang
Liang Chen
Lingfeng Niu
Yu-Hong Dai
Science China(Mathematics), 2024, 67 (06) : 1377 - 1394
[22] Survivable Robotic Control through Guided Bayesian Policy Search with Deep Reinforcement Learning
Raza, Sayyed Jaffar Ali
Dastider, Apan
Lin, Mingjie
2021 IEEE 17TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2021, : 1188 - 1193
[23] Surrogate Models for Enhancing the Efficiency of Neuroevolution in Reinforcement Learning
Stork, Joerg
Zaefferer, Martin
Bartz-Beielstein, Thomas
Eiben, A. E.
PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19), 2019, : 934 - 942
[24] Enhancing Air Conditioning System Efficiency Through Load Prediction and Deep Reinforcement Learning: A Case Study of Ground Source Heat Pumps
Wang, Zhitao
Qiu, Yubin
Zhou, Shiyu
Tian, Yanfa
Zhu, Xiangyuan
Liu, Jiying
Lu, Shengze
ENERGIES, 2025, 18 (01)
[25] Improving exploration efficiency of deep reinforcement learning through samples produced by generative model
Xu, Dayong
Zhu, Fei
Liu, Quan
Zhao, Peiyao
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 185
[26] Collaborative Deep Reinforcement Learning for Joint Object Search
Kong, Xiangyu
Xin, Bo
Wang, Yizhou
Hua, Gang
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7072 - 7081
[27] Learning Mobile Manipulation through Deep Reinforcement Learning
Wang, Cong
Zhang, Qifeng
Tian, Qiyan
Li, Shuo
Wang, Xiaohui
Lane, David
Petillot, Yvan
Wang, Sen
SENSORS, 2020, 20 (03)
[28] Multitask Augmented Random Search in deep reinforcement learning
Thanh, Le Tien
Thang, Ta Bao
Van Cuong, Le
Binh, Huynh Thi Thanh
APPLIED SOFT COMPUTING, 2024, 160
[29] Enhancing Network by Reinforcement Learning and Neural Confined Local Search
Hu, Qifu
Li, Ruyang
Deng, Qi
Zhao, Yaqian
Li, Rengang
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 2122 - 2132
[30] Enhancing arrhythmia prediction through an adaptive deep reinforcement learning framework for ECG signal analysis
Serhani, Mohamed Adel
Ismail, Heba
El-Kassabi, Hadeel T.
Al Breiki, Hamda
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101

← 1 2 3 4 5 →