Deep Similarity-Based Batch Mode Active Learning with Exploration-Exploitation

被引:36
|
作者
Yin, Changchang [1 ]
Qian, Buyue [1 ]
Cao, Shilei [1 ]
Li, Xiaoyu [1 ]
Wei, Jishang [2 ]
Zheng, Qinghua [1 ]
Davidson, Ian [3 ]
机构
[1] Xi An Jiao Tong Univ, Xian, Shaanxi, Peoples R China
[2] HP Labs, 1501 Page Mill Rd, Palo Alto, CA 94304 USA
[3] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICDM.2017.67
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active learning aims to reduce manual labeling efforts by proactively selecting the most informative unlabeled instances to query. In real-world scenarios, it's often more practical to query a batch of instances rather than a single one at each iteration. To achieve this we need to keep not only the informativeness of the instances but also their diversity. Many heuristic methods have been proposed to tackle batch mode active learning problems, however, they suffer from two limitations which if addressed would significantly improve the query strategy. Firstly, the similarity amongst instances is simply calculated using the feature vectors rather than being jointly learned with the classification model. This weakens the accuracy of the diversity measurement. Secondly, these methods usually exploit the decision boundary by querying the data points close to it. However, this can be inefficient when the labeled set is too small to reveal the true boundary. In this paper, we address both limitations by proposing a deep neural network based algorithm. In the training phase, a pairwise deep network is not only trained to perform classification, but also to project data points into another space, where the similarity can be more precisely measured. In the query selection phase, the learner selects a set of instances that are maximally uncertain and minimally redundant (exploitation), as well as are most diverse from the labeled instances (exploration). We evaluate the effectiveness of the proposed method on a variety of classification tasks: MNIST classification, opinion polarity detection and heart failure prediction. Our method outperforms the baselines with both higher classification accuracy and faster convergence rate.
引用
收藏
页码:575 / 584
页数:10
相关论文
共 50 条
  • [1] A similarity-based neighbourhood search for enhancing the balance exploration-exploitation of differential evolution
    Segredo, Eduardo
    Lalla-Ruiz, Eduardo
    Hart, Emma
    Voss, Stefan
    COMPUTERS & OPERATIONS RESEARCH, 2020, 117
  • [2] Stream-based Joint Exploration-Exploitation Active Learning
    Loy, Chen Change
    Hospedales, Timothy M.
    Xiang, Tao
    Gong, Shaogang
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1560 - 1567
  • [3] Bayesian Deep Learning based Exploration-Exploitation for Personalized Recommendations
    Wang, Xin
    Kadioglu, Serdar
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1715 - 1719
  • [4] Adaptive batch mode active learning with deep similarity
    Zhang, Kaiyuan
    Qian, Buyue
    Wei, Jishang
    Yin, Changchang
    Cao, Shilei
    Li, Xiaoyu
    Cao, Yanjun
    Zheng, Qinghua
    EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (04)
  • [5] Adaptive Exploration-Exploitation Active Learning of Gaussian Processes
    Kontoudis, George P.
    Otte, Michael
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9448 - 9455
  • [6] Social Learning and the Exploration-Exploitation Tradeoff
    Mintz, Brian
    Fu, Feng
    COMPUTATION, 2023, 11 (05)
  • [7] Safe Active Dynamics Learning and Control: A Sequential Exploration-Exploitation Framework
    Lew, Thomas
    Sharma, Apoorva
    Harrison, James
    Bylard, Andrew
    Pavone, Marco
    IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (05) : 2888 - 2907
  • [8] Similarity-based active learning methods
    Sui, Qun
    Ghosh, Sujit K.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [9] Recursive Similarity-Based Algorithm for Deep Learning
    Maszczyk, Tomasz
    Duch, Wlodzislaw
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 390 - 397
  • [10] A Novel Active Learning Regression Framework for Balancing the Exploration-Exploitation Trade-Off
    Elreedy, Dina
    Atiya, Amir E.
    Shaheen, Samir, I
    ENTROPY, 2019, 21 (07)