Deep Similarity-Based Batch Mode Active Learning with Exploration-Exploitation

被引：36

作者：

Yin, Changchang ^{[1
]}

Qian, Buyue ^{[1
]}

Cao, Shilei ^{[1
]}

Li, Xiaoyu ^{[1
]}

Wei, Jishang ^{[2
]}

Zheng, Qinghua ^{[1
]}

Davidson, Ian ^{[3
]}

机构：

[1] Xi An Jiao Tong Univ, Xian, Shaanxi, Peoples R China

[2] HP Labs, 1501 Page Mill Rd, Palo Alto, CA 94304 USA

[3] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA

来源：

2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2017年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICDM.2017.67

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Active learning aims to reduce manual labeling efforts by proactively selecting the most informative unlabeled instances to query. In real-world scenarios, it's often more practical to query a batch of instances rather than a single one at each iteration. To achieve this we need to keep not only the informativeness of the instances but also their diversity. Many heuristic methods have been proposed to tackle batch mode active learning problems, however, they suffer from two limitations which if addressed would significantly improve the query strategy. Firstly, the similarity amongst instances is simply calculated using the feature vectors rather than being jointly learned with the classification model. This weakens the accuracy of the diversity measurement. Secondly, these methods usually exploit the decision boundary by querying the data points close to it. However, this can be inefficient when the labeled set is too small to reveal the true boundary. In this paper, we address both limitations by proposing a deep neural network based algorithm. In the training phase, a pairwise deep network is not only trained to perform classification, but also to project data points into another space, where the similarity can be more precisely measured. In the query selection phase, the learner selects a set of instances that are maximally uncertain and minimally redundant (exploitation), as well as are most diverse from the labeled instances (exploration). We evaluate the effectiveness of the proposed method on a variety of classification tasks: MNIST classification, opinion polarity detection and heart failure prediction. Our method outperforms the baselines with both higher classification accuracy and faster convergence rate.

引用

页码：575 / 584

页数：10

共 50 条

[1] A similarity-based neighbourhood search for enhancing the balance exploration-exploitation of differential evolution
Segredo, Eduardo
Lalla-Ruiz, Eduardo
Hart, Emma
Voss, Stefan
COMPUTERS & OPERATIONS RESEARCH, 2020, 117
[2] Stream-based Joint Exploration-Exploitation Active Learning
Loy, Chen Change
Hospedales, Timothy M.
Xiang, Tao
Gong, Shaogang
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1560 - 1567
[3] Bayesian Deep Learning based Exploration-Exploitation for Personalized Recommendations
Wang, Xin
Kadioglu, Serdar
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1715 - 1719
[4] Adaptive batch mode active learning with deep similarity
Zhang, Kaiyuan
Qian, Buyue
Wei, Jishang
Yin, Changchang
Cao, Shilei
Li, Xiaoyu
Cao, Yanjun
Zheng, Qinghua
EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (04)
[5] Adaptive Exploration-Exploitation Active Learning of Gaussian Processes
Kontoudis, George P.
Otte, Michael
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9448 - 9455
[6] Social Learning and the Exploration-Exploitation Tradeoff
Mintz, Brian
Fu, Feng
COMPUTATION, 2023, 11 (05)
[7] Safe Active Dynamics Learning and Control: A Sequential Exploration-Exploitation Framework
Lew, Thomas
Sharma, Apoorva
Harrison, James
Bylard, Andrew
Pavone, Marco
IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (05) : 2888 - 2907
[8] Similarity-based active learning methods
Sui, Qun
Ghosh, Sujit K.
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
[9] Recursive Similarity-Based Algorithm for Deep Learning
Maszczyk, Tomasz
Duch, Wlodzislaw
NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 390 - 397
[10] A Novel Active Learning Regression Framework for Balancing the Exploration-Exploitation Trade-Off
Elreedy, Dina
Atiya, Amir E.
Shaheen, Samir, I
ENTROPY, 2019, 21 (07)

← 1 2 3 4 5 →