Enhancing Noisy Binary Search Efficiency through Deep Reinforcement Learning

被引:0
|
作者
Ma, Rui [1 ]
Tao, Yudong [1 ]
Khodeiry, Mohamed M. [2 ]
Alawa, Karam A. [2 ]
Shyu, Mei-Ling [3 ]
Lee, Richard K. [1 ,2 ]
机构
[1] Univ Miami, Dept Elect & Comp Engn, Coral Gables, FL USA
[2] Univ Miami, Bascom Palmer Eye Inst, Miller Sch Med, Miami, FL USA
[3] Univ Missouri Kansas City, Sch Sci & Engn, Kansas City, MO USA
关键词
Deep learning; deep reinforcement learning; noisy binary search;
D O I
10.1109/IRI58017.2023.00033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Noisy binary search (NBS) aims to find the closest element to a target value within a sorted array through erroneous queries. In an ideal NBS environment where the error rate remains constant, and the costs of all queries are the same, the maximum likelihood estimation (MLE) procedure has been proven to be the optimal decision strategy. However, in some non-ideal NBS problems, both the error rates and the costs are dependent on the queries, and in some cases, finding the optimal decision strategies can be intractable. We propose to use deep reinforcement learning to approximate the optimal decision strategy in the NBS problem, in which an intelligent agent is used to interact with the NBS environment. A dueling double deep Q-network guides the agent to take action at each step, either to generate a query or to stop the search and predict the target value. An optimized policy will be derived by training the network in the NBS environment until convergence. By evaluating our proposed algorithm on a non-ideal NBS environment, visual field test, we show that the performance of our proposed algorithm surpasses baseline visual field testing algorithms by a large margin.
引用
收藏
页码:154 / 159
页数:6
相关论文
共 50 条
  • [21] Enhancing cut selection through reinforcement learning
    Shengchao Wang
    Liang Chen
    Lingfeng Niu
    Yu-Hong Dai
    Science China(Mathematics), 2024, 67 (06) : 1377 - 1394
  • [22] Survivable Robotic Control through Guided Bayesian Policy Search with Deep Reinforcement Learning
    Raza, Sayyed Jaffar Ali
    Dastider, Apan
    Lin, Mingjie
    2021 IEEE 17TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2021, : 1188 - 1193
  • [23] Surrogate Models for Enhancing the Efficiency of Neuroevolution in Reinforcement Learning
    Stork, Joerg
    Zaefferer, Martin
    Bartz-Beielstein, Thomas
    Eiben, A. E.
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19), 2019, : 934 - 942
  • [24] Enhancing Air Conditioning System Efficiency Through Load Prediction and Deep Reinforcement Learning: A Case Study of Ground Source Heat Pumps
    Wang, Zhitao
    Qiu, Yubin
    Zhou, Shiyu
    Tian, Yanfa
    Zhu, Xiangyuan
    Liu, Jiying
    Lu, Shengze
    ENERGIES, 2025, 18 (01)
  • [25] Improving exploration efficiency of deep reinforcement learning through samples produced by generative model
    Xu, Dayong
    Zhu, Fei
    Liu, Quan
    Zhao, Peiyao
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 185
  • [26] Collaborative Deep Reinforcement Learning for Joint Object Search
    Kong, Xiangyu
    Xin, Bo
    Wang, Yizhou
    Hua, Gang
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7072 - 7081
  • [27] Learning Mobile Manipulation through Deep Reinforcement Learning
    Wang, Cong
    Zhang, Qifeng
    Tian, Qiyan
    Li, Shuo
    Wang, Xiaohui
    Lane, David
    Petillot, Yvan
    Wang, Sen
    SENSORS, 2020, 20 (03)
  • [28] Multitask Augmented Random Search in deep reinforcement learning
    Thanh, Le Tien
    Thang, Ta Bao
    Van Cuong, Le
    Binh, Huynh Thi Thanh
    APPLIED SOFT COMPUTING, 2024, 160
  • [29] Enhancing Network by Reinforcement Learning and Neural Confined Local Search
    Hu, Qifu
    Li, Ruyang
    Deng, Qi
    Zhao, Yaqian
    Li, Rengang
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 2122 - 2132
  • [30] Enhancing arrhythmia prediction through an adaptive deep reinforcement learning framework for ECG signal analysis
    Serhani, Mohamed Adel
    Ismail, Heba
    El-Kassabi, Hadeel T.
    Al Breiki, Hamda
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101